Delivery of different services through different client devices

ABSTRACT

A system that handles delivery of a service through a client device or a secondary device paired with the client device, includes an interactive service provider and the client device. The interactive service provider inserts at least one of digital watermarks, fingerprints, and trigger identifiers at event opportunities in media content. The client device detects at least one of the inserted digital watermarks, the digital fingerprints, and the inserted trigger identifiers in the media content. The client device further renders overlay graphics on the media content and activates at least one of input devices in vicinity of the client device or the rendered overlay graphics. The client device receives trigger responses over an activated overlay graphic, via the activated input devices. The client device further displays an interactive view on the client device, to enable delivery of services in response to the received trigger responses.

CROSS-REFERENCE TO RELATED APPLICATIONS/INCORPORATION BY REFERENCE

This Application makes reference to, and claims priority to, and claimsbenefit from U.S. Application Ser. No. 62/511,190, which was filed onMay 25, 2017.

This Application also makes reference to: U.S. application Ser. No.15/986,218, filed May 22, 2018.

Each of the above stated Patent Applications is hereby incorporatedherein by reference in its entirety.

FIELD OF TECHNOLOGY

Certain embodiments of the present disclosure relate to television andbroadcasting technologies for streaming media networks. Morespecifically, certain embodiments of the present disclosure relate todelivery of different services through different client devices.

BACKGROUND

Recent technological advancements in broadcasting and media deliverytechnologies have paved the way for promoters to target relevantaudiences across different media networks, such as linear networks, VODnetworks, and mobile networks, with promotional content of products orservices. Such media networks maintain a curated repository of mediacontent that is delivered to users across different media platformsunder ownership of the media network. The media content is distributedaccording to a schedule with slots dedicated to the promotional contentof products or services. Such slots can be placed between two segmentsof the media content or over a defined region of the media content. Theaudience that engages with the media content is served with suchpromotional content.

The media networks that are owners of the media content, provideviewership information of served promotional content to associatedpromotional networks. However, such viewership information obscures theintent or interest of target audience to subscribe to the products orservices upon watching associated promotional content. Additionally,promoters that prefer to target certain audiences for granular periodsof time in a day, require intent or interest of a user in associatedproduct or services for such granular periods of time. For example, arestaurant chain may wish to target audiences at different periods oftime, such as before breakfast time, lunch time, and dinner time, toraise possibility of users to purchase products items from therestaurant. Moreover, such promoters may request to improve the intentof target audience to purchase promoted products or services, which maybe a technically challenging task.

Currently, the ability to measure intent and further improve the intentin a way that encourages each target audience member to purchase productitems is performed via detection of impressions or clicks on a promotedmedia content. However, with varying attention span of audience members,such impressions or clicks result in lower conversion rates. Theimpressions or clicks are considered an imperfect measure of intent in atime when audience members have a transient attention span. The currenttechnological solutions are inefficient to measure the intent orinterest of users to purchase products or services that are promotedthrough the promotional content. The transient attention span of atarget audience affects the conversion of the target audience. Thus,advanced systems may be required that may transform that transientattention to selective sustained attention for viewed promotionalcontent.

Further limitations and disadvantages of conventional and traditionalapproaches will become apparent to one of skill in the art, throughcomparison of such systems with some aspects of the present disclosureas set forth in the remainder of the present application with referenceto the drawings.

BRIEF SUMMARY OF THE DISCLOSURE

Systems and methods are provided for delivery of different servicesthrough different client devices, substantially as shown in and/ordescribed in connection with at least one of the figures, as set forthmore completely in the claims.

These and other advantages, aspects, and novel features of the presentdisclosure, as well as details of an illustrated embodiment thereof,will be more fully understood from the following description anddrawings.

BRIEF DESCRIPTION OF SEVERAL VIEWS OF THE DRAWINGS

FIG. 1 is a block diagram that illustrates an exemplary networkenvironment for delivery of different services through different clientdevices, in accordance with an exemplary embodiment of the disclosure.

FIG. 2 is a block diagram that illustrates an exemplary triggermanagement system that handles insertion and activation of triggeridentifiers in media content received from video service providers, inaccordance with an exemplary embodiment of the disclosure.

FIGS. 3A and 3B, collectively depict a first exemplary scenario thatillustrates a client-side presentation of overlay graphics and receiptof trigger response over presented overlay graphics, via an inputdevice, in accordance with an embodiment of the disclosure.

FIG. 4 depicts a second exemplary scenario that illustrates client-sidepresentation of overlay graphics and receipt of trigger response overpresented overlay graphics, via a trigger input rendered on a secondarydevice paired with client device, in accordance with an embodiment ofthe disclosure.

FIG. 5 depicts a third exemplary scenario that illustrates client-sidepresentation of overlay graphics and receipt of trigger response overpresented overlay graphics based on a direct user selection of presentedoverlay graphics, in accordance with an embodiment of the disclosure.

FIG. 6 depicts a fourth exemplary scenario that illustrates client-sidepresentation of overlay graphics and receipt of voice responses overpresented overlay graphics, handles by a smart conversational agent, inaccordance with an embodiment of the disclosure.

FIGS. 7A to 7E are flowcharts that collectively illustrate a method fordelivery of different services through different client devices, inaccordance with an embodiment of the disclosure.

FIG. 8 is a conceptual diagram that illustrates an example of a hardwareimplementation for the system of FIG. 1 that employs a processing systemfor delivery of different services through different client devices, inaccordance with an exemplary embodiment of the disclosure.

DETAILED DESCRIPTION OF THE DISCLOSURE

Certain embodiments of the disclosure may be found in a system andmethod that handles delivery of a service through a client device or asecondary device paired with the client device. Examples of thedifferent services that may be delivered through the client device 138,may include, but are not limited to an (over-the-top) OTT servicethrough internet, a service that can be delivered through broadcastnetworks on different client devices, such as (Advanced TelevisionSystems Committee) ATSC smart TVs (e.g., ATSC 3-enabled TV), presentable(visual, aural, etc.) services, consumable services, online services,purchase service, or other access independent services or dependentservices. Also, different client devices, through which differentservices may be delivered, may be an ATSC smart TV capable of readingwatermarks for a broadcast implementation, a set top box, aninternet-based television, a smartphone, a tablet, and the like. Insteadof an OTT service, different services may be delivered via an enabledset-top, ATSC3 TV, or Smart TV with Automatic Content Recognition (ACR),or a legacy TV with a smart second device that is cable of decodingaudio watermarks in the media content.

Various embodiments of the disclosure include a system that may includean interactive service provider and the client device. The interactiveservice provider may include one or more processors and the clientdevice may include a processor. The one or more processors at theinteractive service provider may be configured to insert at least one ofa plurality of digital watermarks, a plurality of fingerprints, and aplurality of trigger identifiers at a plurality of event opportunitiesin media content. The media content may be prepared by a video serviceprovider that is different from the interactive service provider. Theinsertion may be executed by the interactive service provider via afirst integration with the video service provider. The media content mayinclude programming media content and non-programming media content andthe plurality of event opportunities in the media content may correspondto a plurality of candidate time intervals in a playback duration of themedia content.

The processor in the client device may be configured to detect at leastone of the inserted plurality of digital watermarks, the plurality ofdigital fingerprints, and the inserted plurality of trigger identifiersin the media content in a playback duration of the media content at theclient device. The detection may be instructed by the interactiveservice provider via a second integration with the client device. Theprocessor at the client device may be further configured to render aplurality of overlay graphics on the media content for the plurality ofcandidate time intervals in the media content. The processor at theclient device may be further configured to activate, via the secondintegration, at least one of one or more input devices in vicinity(i.e., a defined proximal range) of the client device or the renderedplurality of overlay graphics. One or more trigger responses may befurther received, by the processor in the client device, over anactivated overlay graphic rendered on the media content in the playbackduration, via the activated one or more input devices. The processor atthe client device may be further configured to display an interactiveplatform (e.g., an interactive interface) on the client device, toenable delivery of at least one service in response to the received oneor more trigger responses. The delivery of the services may be enabledat the client device in communication with a fulfillment server thatfulfills the delivery of the services to increase a user engagement toselect the services. For example, the services may include at least oneof a direct payment, a direct transaction, or a direct notification forselected at least one product offering or at least one service offering,and a direct update of the selected at least one product offering or theat least one service offering on a shopping cart, a personalized list,or a social platform.

In accordance with an embodiment, the one or more processors may befurther configured to receive, from the video service provider, themedia content that includes programming media content andnon-programming media content, Also, the one or more processors may befurther configured to receive a request for analysis of the mediacontent from the video service provider. The analysis may include atleast one digital fingerprinting and digital watermarking of theprogramming and/or non-programming media content.

In accordance with an embodiment, the one or more processors may befurther configured to fingerprint the programming media content and thenon-programming media content and further generate fingerprintinformation for the programming media content and the non-programmingmedia content. The fingerprint information may include at least one ofacoustic fingerprint information and video fingerprint information fordifferent segments of the media content, where such different segmentsmay include audio frames or image frames of the media content.

In accordance with an embodiment, the plurality of digital watermarksmay be inserted into an audio portion or a video portion of the mediacontent, at the plurality of event opportunities in the media content.The one or more processors may be further configured to generatewatermark information that may include metadata for at least one of anacoustic watermark or a video watermark, inserted between differentsegments of the media content.

In accordance with an embodiment, each event opportunity may correspondto a specified event in the media content and the specified event maycorrespond to at least one of start credits, end credits, lull points,time-based marked regions, location-based marked regions, context-basedmarked regions, explicit scene start and end, and product placementopportunities, within the media content. Each trigger identifier of theplurality of trigger identifiers may be at least one, but not limitedto, of Society of Cable and Telecom Engineers (SCTE) 35 triggers,SCTE-104 triggers, .m3u8 playlists, .mpd manifest tags, Nielsen ID3tags, inaudible beacons, image beacons, and data beacons.

In accordance with an embodiment, the processor in the client device maybe further configured to pair up at least one of one or more inputdevices available in vicinity of the client device or the secondarydevice. The one or more input devices may be paired via at least one ofa Bluetooth network, a Wi-Fi network, an internet-based network, a wiredlocal network, or a wireless ad hoc network. In some embodiments, avirtual voice assistant may be enabled on the input device, or at leastone of the client device or the secondary device. The input device,which is enabled with the virtual voice assistant, may be configured toactively wait and listen for a trigger identifier in an audio portion ofthe media content.

In accordance with an embodiment, the processor in the client device maybe further configured to instruct the virtual voice assistant to detectat least one beacon within the playback duration of the programmingmedia content/watermark/trigger recognizer. The processor in the clientdevice may be further configured to facilitate a conversation with auser associated with the client device, to receive a trigger responsefrom the user. The trigger response may be received through a user'sspeech input.

In some cases, some virtual voice assistants may only be loosely (orselectively) paired to the client device based on a proximity and anability to hear the audio from the client device. It may be theinteractive service provider (or a partner) that may configure theclient device to detect triggers. The user may have to accept or approvethe configuration through a secondary device (e.g., a smartphone) thatis different from the client device (e.g., a smart TV) that receives theprepared media content from the video service provider. Alternativelystated, a voice assistant application installed in a secondary device(in vicinity of the client device) may include (or further update orinstall) an voice assistant service that may offer relevant offers fromthe interactive service provider (or other partners) while the user isengaged in watching the media content played on the client device. Also,the input device may be loosely paired (or selectively paired forcertain specific functionality) or even unpaired with the client device,and therefore, the input device may listen (to recognize) for a triggeridentifier in the audio portion of the media content automatically basedon instructions from the interactive service provider.

In accordance with an embodiment, the non-programming media content mayinclude promotional media content for at least one of a product offeringor a service offering. The promotional media content may include atleast one of graphical content, textual content, video content, oranimated content.

In accordance with an embodiment, the processor in the client device maybe further configured to execute a check to decide whether to activatethe one or more paired input devices from a set of input devices, therender plurality of overlay graphics on the media content. The decisionis based on the check executed on a defined criteria that is associatedwith records of previous service requests for the plurality of eventopportunities on which the at least one service was delivered in past.The defined criteria may include at least one of a user-definedconstraint, an inventory constraint, and a specified threshold count oftrigger responses for a product offering or a service offering promotedby the non-programming media content.

In accordance with an embodiment, the one or more processors may befurther configured to deliver notifications on at least one of theclient device or the secondary device paired with the client device, inresponse to the received one or more trigger responses. The one or moretrigger responses may correspond to one or more user interactions thatmay be received based on at least one of a touch input, a gesture input,a haptic feedback input, and a voice command input.

FIG. 1 is a block diagram that illustrates an exemplary networkenvironment for delivery of different services through different clientdevices, in accordance with an exemplary embodiment of the disclosure.FIG. 1 shows a network environment 100 that includes a contentproducer/primary distributor 102, which may include an ad agency 104, astudio 106, and a pre-produced master content store 108. There isfurther shown a video service provider 110, which may include a usermanager 112, a programming content store 114, a media encoder/packager116, and a trigger insert software development kit (SDK) 118. Aninteractive service provider 120 may include a trigger management system122, a non-programming media content store 124, a media analysis system126, and a user manager 128. The trigger management system 122 mayinclude a fingerprinting system 130 and a watermarking system 132.Similarly, the non-programming media content store 124 may include agraphic overlay manager 134 and a watermark/fingerprint/trigger registry136.

The network environment 100 may further include a client device 138, anda secondary device 140. In some embodiments, a display 142 may be anexternal device coupled to the client device 138 in the networkenvironment 100. In other embodiments, the display 142 may be integratedwith the client device 138. The network environment 100 may furtherinclude a content/watermark/trigger recognizer 144, a trigger client SDK146, and interactive service provider partners 148, which may include anoverlay ad server 150, and a fulfillment server 152. The aforementionedcomponents in the network environment 100 may be communicatively coupledto each other, via a communication network 154.

The content producer/primary distributor 102, the video service provider110, the interactive service provider 120, the client device 138, andthe interactive service provider partners 148 may together be a part ofa system of networks, devices, and/or circuitries, interconnected toeach other via the communication network 154. Although not shown in FIG.1, the interactive service provider 120 may include one or moreprocessors that may be configured to execute all the operations of theinteractive service provider 120 and similarly, the client device 138may include a processor that may be configured to execute all theoperations of the client device 138.

The content producer/primary distributor 102 may be a server or adistributed network of servers that may be managed (or owned) by aproducer entity or a primary distributor entity. The contentproducer/primary distributor 102 may store and distribute media content,which includes programming media content (e.g., television shows, liveprograms, trailers, movies, etc.) as well as non-programming mediacontent (e.g., advertisement videos, graphic advertisements (e.g., bugs,logos, banners, tickers, popups, ⅓^(rd) graphics, etc.), promotionalvideos, and promotional audio, etc.). Also, the content producer/primarydistributor 102 may prepare, package, and distribute the media content(programming and/or non-programming) to different networks that servethe media content to different client devices, through a secure (oropen) content delivery network. The content producer/primary distributor102 may include the pre-produced master content store 108 that may becoupled to the ad agency 104 and the studio 106.

The ad agency 104 may be a content producer that creates, plans, andhandles different tasks associated with production and distribution ofmedia content (which may be advertisements or other promotionalcontent). The ad agency 104 may produce commercial videos, radiocommercials, online advertisements, out-of-home advertisements, and thelike, based on an advertising campaign that may be specified by aproduct/service promoter or owner.

The studio 106 may be a platform that facilitate content producers tocreate, edit, curate, and/or package different types of media contentfor distribution to different programming networks (e.g. the videoservice provider 110) that stream programming media content to differentclient devices (e.g., the client device 138). In some embodiments, thestudio 106 may be an application platform that may be configured toproduce media content and curate the produced media content fordistribution to different programming networks.

The pre-produced master content store 108 may be a media server or adistributed network of media servers, which may be a part of a contentdelivery network for producers/distributors of the media content. Thepre-produced master content store 108 may be configured to receiveproduced and/or curated media content (e.g., video ads, promotionsvideos, overlay graphics (e.g., TV shows, live shows, movies, ad videos,bugs, logos, ⅓^(rd) graphics, banners, etc.,), promotional audio, etc.)from the ad agency 104 and the studio 106. The pre-produced mastercontent store 108 may be further configured to maintain a curatedrepository of non-programming media content that is received from the adagency 104, the studio 106, or other content production and distributionplatforms. The pre-produced master content store 108 may also storeprogramming content that may need to be analyzed and have triggeridentifiers inserted to mark product placement, e.g., a positivesentiment beach scene over which non-programing content may be overlaidfor certain client devices.

The video service provider 110 may be a distribution network (e.g., atelevision network or a media streaming network) that handlesproduction, curation, and distribution of the media content that may beprepared based on programming and/or non-programming media content,which is received from the content producer/primary distributor 102, orother sources. The video service provider 110 may be a third partydistribution network that may be further configured to store programmingmedia content along with other programming or non-programming assets(e.g. the media content received from the content producer/primarydistributor 102). The distribution of the media content is managed viadifferent delivery channels that may be managed by the video serviceprovider 110, for example, broadcast channels, internet-based mediastreaming applications, web channels, podcast channels, radio channels,and the like. The video service provider 110 may include the usermanager 112, the programming content store 114, the mediaencoder/packager 116, and a trigger insert SDK 118.

The user manager 112 may comprise suitable logic, circuitry, andinterfaces that may be configured to manage processes and requests thatmay be associated with user preferences, user activities (or userfootprints), user selections, or user profiles across different clientdevices (e.g., the client device 138). Such user preferences, useractivities, user selections, or user profiles may be stored and managedas user-related data in a user database (not shown in FIG. 1).

The user database may comprise suitable logic, circuitry, and interfacesthat may be configured to store and manage the user-related data in realtime. The user-related data may comprise at least a set ofuser-authentication data, a subscription data, user preferences, userattributes (e.g., demographic profile (e.g., age, gender, location,income details, etc.) and user footprints (e.g., cookies-based data)that defines user activities on applications/client devices associatedwith a user. The user-related data may be received as structured andrelational or unstructured and non-relational data from one or more datasources. Every time a user request is received by the video serviceprovider 110 to stream the programming media content to a client device(e.g., the client device 138), the user request may be communicated, viathe communication network 154, to the user manager 112. In certainimplementations, the user manager 112 may be configured to facilitatetransactions or payments made on options delivered over the programmingmedia content from the video service provider 110, in conjunction with atransaction system.

The programming content store 114 may be a media server or a distributednetwork of media servers, which may be a part of a programming contentdelivery network that is handled by the video service provider 110. Theprogramming content store 114 may be a curated repository of differentprogramming media assets that may be stored along with non-programmingmedia content. In case a user request for programming media content isreceived, the programming content store 114 may be configured to deliverdifferent programming media assets (e.g., episodes of different shows,movies, trailers, video ads, offers etc.) along with non-programmingmedia content (e.g., overlay graphics, or other promotional content,etc.) to the media encoder/packager 116.

The media encoder/packager 116 may comprise suitable logic, circuitry,and interfaces that may be configured to package different programmingmedia assets to obtain media content for delivery to the client device138. In some cases, the media content may be further prepared using thenon-programming media content (e.g., overlays) at different eventopportunities in the media content. The media encoder/packager 116 maybe further configured to insert a plurality of trigger identifiers(e.g., Society of Cable and Telecom Engineers (SCTE-35) triggers,SCTE-104 triggers, Nielsen ID3 tags, inaudible beacons, data beacons,etc.) based on integration with the trigger insert SDK 118. Theplurality of trigger identifiers may be further utilized to triggerdifferent user options at client devices (e.g., the client device 138)).The media encoder/packager 116 may be further configured to encode themedia content and encapsulate the encoded media content in a mediastream (e.g., a media stream that includes H.264 video stream and AACaudio stream or an MPEG-2 TS streams). Such media stream may then bedelivered to different client devices (e.g., the client device 138) viathe communication network 154.

The trigger insert SDK 118 may be an instruction-based platform for aspecific application, for example, a controlled insertion of a pluralityof trigger identifiers in media content at different event opportunitiesby an external service provider (e.g., the interactive service provider120). The trigger insert SDK 118 may be operational in the mediaencoder/packager 116 of the video service provider 110. The triggerinsert SDK 118 may include a development user interface (UI), a set ofroutines, calls, application programming interfaces (APIs), or othertools, which may be accessible by at least the video service provider110 and the interactive service provider 120. By integrations with thetrigger insert SDK 118, the interactive service provider 120 maydirectly insert trigger identifiers at different event opportunities inthe media content.

The interactive service provider 120 may be a service network (e.g., anaffiliate network) that may include a server or a distributed network ofservers executing different operations on media content (programmingand/or non-programming) for different video service providers (e.g., thevideo service provider 110). Different operations may be executed basedon requirements of different video service providers. The requirementsmay include at least one of media content (programming ornon-programming) analysis, watermarks insertion, fingerprintsgeneration, and trigger identifiers insertion. The media content may bereceived from different service providers (e.g., the video serviceprovider 110) and a modified version of the media content (afterexecution of different operations) may be transmitted back to intendedvideo service providers.

The trigger management system 122 may comprise suitable logic,circuitry, and interfaces that may be configured to fingerprint,watermark, and/or schedule insertion of different trigger identifiers inmedia content (programming and/or non-programming) received from thevideo service provider 110. The fingerprints, watermarks, or triggeridentifiers may be utilized at the client device 138 to presentinteractive options over programming or non-programming media content(e.g., promotional graphics) served at the client device 138. Thetrigger management system 122 may be implemented as a computationalcircuitry or as a set of instructions that are executed on acomputational circuitry on one or more servers of the interactiveservice provider 120.

The non-programming media content store 124 may be a server, adistributed network of servers, or a set of applications that areexecuted on the server or the distributed network of servers. Theoperations of the non-programming media content store 124 may be handledby the interactive service provider 120. The non-programming mediacontent store 124 may be configured to manage overlay of graphicalassets at different client devices (e.g., the client device 138) andfurther maintain a database of watermark information, fingerprintinformation, or event information associated with each triggeridentifier for media content (programming or non-programming) receivedfrom the video service provider 110.

The media analysis system 126 may comprise suitable logic, circuitry,and interfaces that may be configured to determine the one or more eventopportunity points within the media content (programming ornon-programming) that is received from a third party video serviceprovider (e.g., the video service provider 110). The media analysissystem 126 may be further configured to identify information associatedwith different segments (e.g., specific portions, specific scenes, PODdurations, commercial breaks, etc.) in the media content. Theinformation may be identified based on analysis of a context, a userpreference, or a defined goal (associated with flights of anadvertisement campaign) of different segments in the media content(programming and/or non-programming). Such information may includedifferent attributes, for example, a duration, a tag (or metadata), asuitable overlay asset for an event opportunity, a total number of eventopportunities in a segment, and the like. The determined one or moreevent opportunity points and associated information may be transmittedto the trigger management system 122 and the non-programming mediacontent store 124.

The user manager 128 may be similar to the user manager 112 of the videoservice provider 110. Therefore, the details of the user manager 128 hasbeen omitted from the disclosure for the sake of brevity. In some cases,however, the user manager 128 may include separate databases, with eachseparate database configured to store user-related data from a specificvideo service provider (e.g., the video service provider 110).

The fingerprinting system 130 may comprise suitable logic, circuitry,and interfaces that may be configured to fingerprint an audio portionand/or a video portion of the media content (programming and/ornon-programming) that is received from a specific video service provider(e.g., the video service provider 110). The fingerprinting system 130may also fingerprint the programing content to serve as a triggeringmechanism for different overlay (or event) opportunities in the mediacontent. The fingerprinting system 130 may generate fingerprintinformation, which may include at least one of acoustic fingerprintinformation and video fingerprint information for different segments (orselected audio or image frames) of the media content (programming and/ornon-programming). The generated fingerprint information for an asset(programming and/or non-programming) may be utilized to uniquelyidentify and detect playback of the asset (programming and/ornon-programming) at different client devices (e.g., the client device138).

The watermarking system 132 may comprise suitable logic, circuitry, andinterfaces that may be configured to insert digital watermarks(perceptible or imperceptible) into an audio portion and/or a videoportion of the media content (programming and/or non-programming)received from a specific video service provider (e.g., the video serviceprovider 110). The watermarking system 132 may generate watermarkinformation, which may include metadata for at least one of an acousticwatermark or a video watermark inserted between different segments (orselected audio or image frames) of the media content (programming and/ornon-programming). The generated watermark information for an asset(programming and/or non-programming) may be utilized to uniquelyidentify and detect playback of the asset (programming and/ornon-programming) at different client devices (e.g., the client device138). The watermarking system 132 may also watermark the programingcontent to serve as a triggering mechanism for different overlayopportunities in the media content.

The graphic overlay manager 134 may comprise suitable logic, circuitry,and interfaces that may be configured to manage an overlay scheduledatabase for a repository of overlay graphics (e.g., logos, bugs,banners, tickers, ⅓^(rd) graphics, sliders, etc.). The overlay schedulemay be delivered to the graphic overlay manager 134 by different contentproducers/primary distributors (e.g., the content producer/primarydistributor 102). The overlay schedule may include details of what tooverlay, when to overlay, for what duration to overlay and how thatoverlay should be presented at different devices (e.g., the clientdevice 138).

The watermark/fingerprint/trigger registry 136 may comprise suitablelogic, circuitry, and interfaces that may be configured to manage adatabase or a set of databases for different types of informationreceived from the fingerprinting system 130, the watermarking system132, and/or a trigger scheduler (not shown in FIG. 1). The differenttypes of information may include fingerprint information, watermarkinformation, and event information for each portion (or asset) inreceived media content (programming and/or non-programming) from thevideo service provider 110. Such types of information may be utilized tosearch for metadata and details of an event trigger that is detected atdifferent client devices (e.g., the client device 138). Also, such typesof information may be utilized to tracks watermarks associated withprogramming media content. The event trigger may correspond to an eventopportunity, details of which may be specified in thewatermark/fingerprint/trigger registry 136.

The client device 138 may comprise suitable logic, circuitry, andinterfaces that may be configured to request the video service provider110 to stream programming media content at the client device 138. Theclient device 138 may also request the video service provider 110 tostream programming media content (e.g., TV shows, live matches, movies,etc.) along with further requests to the interactive service provider120 for activation of trigger inputs (overlaid graphical buttons orphysical input devices). The requests may be associated with a userintention to consume/engage/interact (e.g., purchase, sell, bid, share,etc.) with the non-programming media content presented along with theprogramming media content. Such trigger inputs may be activated forvarious event opportunities in the media content, for example,television programming or on-demand shows (e.g. dramas, movies,advertisements, contextually relevant product placement scenes, etc.),at a display view of the client device 138. Examples of the clientdevice 138 may include, but are not limited to, a Set-Top-Box (STB), an(internet protocol television) IPTV, a hybrid box, a cable converter, asmartphone, a laptop, and a tablet, a smart TV, a digital media player(for example, “Fire” TV, or other brand-based TV, etc.). Also the clientdevice 138, through which different services may be delivered, may be anATSC smart TV capable of reading watermarks for a broadcastimplementation, a set top box, an internet-based television, asmartphone, a tablet, a set-top box, an ATSC3 TV, or a Smart TV with ACR(automatic content recognition), or a legacy TV with a smart seconddevice that is cable of decoding an audio watermark.

In some embodiments, the client device 138 may facilitate playback ofthe programming media content via the display 142 that may beperipherally integrated with the client device 138. Examples of thedisplay 142 may include, but not limited to a television display (e.g.Liquid Crystal Display (LCD), Light Emitting Diode (LED) display,organic light emitting diodes (OLED) display, plasma display, Thin-FilmTransistor (TFT) display, etc.) a projector screen, a computer monitor,a three dimensional (3D) display, and an (Augmented Reality/VirtualReality (AR/VR display).

The secondary device 140 may comprise suitable logic, circuitry, andinterfaces that may be configured to display trigger inputs (overlaidgraphic buttons) in response to an event trigger that occurs based on adetection of a trigger identifier with the playback of the media contentat the client device 138. Additionally, the secondary device 140 may beconfigured to display notifications, payment options, or, promotionalcontent, via at least one delivery platform accessible on the secondarydevice 140. The one or more delivery platforms may correspond to acommunication exchange platform that may incorporate visual, audible, orexecutable data to serve notifications or associated promotional contentonto the secondary device 140. Examples of the one or more deliveryplatforms may include, but not limited to an email platform, a mobilemessaging platform, a webpage, a podcast, and a push notification.Examples of the secondary device 140 may include, but not limited tosmartphones, tablets, phablets, smart watches, smart glasses, and smartspeakers/with or without artificially intelligent (AI) conversationalagents. In some cases, the secondary device 140, for e.g., a smartconversational agent, may be configured to detect the audio watermarkitself and may not rely on the client device 138 for detection.

The content/watermark/trigger recognizer 144 may comprise suitablelogic, circuitry, and interfaces that may be configured to identifytrigger identifiers, inserted watermarks, or media content based onfingerprints at the client device 138, based on integration with theinteractive service provider 120, via the trigger client SDK 146.Alternatively stated, the content/watermark/trigger recognizer 144 maydetect and/or identify information associated a trigger identifier, aninserted watermark, or a fingerprint while the programming media content(along with non-programming media content) is played at the clientdevice 138. Along with trigger client SDK 146, thecontent/watermark/trigger recognizer 144 may communicate with theinteractive service provider 120, via a set of calls or otherinstructions (e.g., API calls), to identify information (e.g., eventinformation, watermark information, fingerprint information, etc.) inresponse to a detection of an event trigger at the client device 138.

The trigger client SDK 146 may be an instruction-based platform for aspecific application, for example, an instruction based identificationof information (e.g., event information, watermark information,fingerprint information, etc.) in response to a detection of an eventtrigger at the client device 138. The trigger insert SDK 118 may beoperational, at the client-side, i.e., at the client device 138, in thecontent/watermark/trigger recognizer 144. Similar to the trigger insertSDK 118, the trigger client SDK 146 may include a development UI, a setof routines, calls, APIs, or other tools, which may be accessible by atleast the content/watermark/trigger recognizer 144 and the client device138. By integrations with the trigger client SDK 146, the interactiveservice provider 120 may directly communicate certain information (e.g.,event information, watermark information, fingerprint information, etc.)in response to a detection of an event trigger at the client device 138.

The interactive service provider partners 148 may be a group of servicenetworks (e.g., different affiliate networks) that may have partnershipsand affiliations with the video service provider 110 and the interactiveservice provider 120. The interactive service provider partners 148 mayinclude a server or a distributed network of servers that serve overlayadvertisement graphics (or media content) to the client device 138 inresponse to request from the client device. Also, the differentoperations on media content (programming or non-programming) fordifferent video service providers (e.g., the video service provider110). Different operations may be executed based on requirements ofdifferent video service providers.

The overlay ad server 150 may comprise suitable logic, circuitry, andinterfaces that may be configured to design or produce and distributeoverlay graphics along with associated metadata to the client device 138in response to requests received from the client device 138. Arepository of overlay graphics (e.g., bugs, logos, tickers, banners,sliders, ⅓^(rd) graphics, full screen graphics, etc.) may also bemaintained at the overlay ad server 150.

The fulfillment server 152 may comprise suitable logic, circuitry, andinterfaces that may be configured to deliver a service (e.g., paymentservice, order service, bid/reverse bidding service, registration,subscription, social sharing, etc.) to a user in response to a requestfrom different client devices, such as the client device 138.

The communication network 154 may comprise suitable logic, circuitry,and interfaces that may be configured to manage a plurality of networkports and a plurality of communication channels for transmission andreception of communication data (for e.g. media streams, request data,metadata, database files, web data, etc.). The communication data maycorrespond to data received and/or exchanged among the contentproducer/primary distributor 102. The video service provider 110, theclient device 138, the secondary device 140, thecontent/watermark/trigger recognizer 144, the trigger client SDK 146,and interactive service provider partners 148, via the communicationnetwork 154. Each network port may correspond to a virtual address (or aphysical machine address) for transmission and reception of thecommunication data. For example, the virtual address may be an InternetProtocol version 4 (IPV4) or an Internet Protocol version 6 (IPV6)address, and the physical address may be a media access control (MAC)address. An application layer may be associated with the communicationnetwork 154 for implementation of communication protocols on one or morecommunication requests from at least one of the one or more computingdevices

Examples of the communication protocol may include, but not limited to aHypertext Transfer Protocol (HTTP), a File Transfer Protocol (FTP), aSimple Mail Transfer Protocol (SMTP), a Domain Name Server (DNS)protocol, and a Common Management Information Protocol (CMIP) OverTransmission Control Protocol/Internet Protocol TCP/IP (CMOT).

Examples of the communication channels may include, but are not limitedto a wireless channel, a wired channel, a combination of wireless andwired channel thereof. The wireless or wired channel may be associatedwith a data standard which may be defined by one of a Local Area Network(LAN), a Personal Area Network (PAN), a Wireless Local Area Network(WLAN), a Wireless Sensor Network (WSN), a Wireless Area Network (WAN),and a Wireless Wide Area Network (WWAN). Additionally, the wired channelmay be selected on the basis of bandwidth criteria. For example, anoptical fiber channel may be used for a high bandwidth communication,and a coaxial cable (or Ethernet-based communication channel) may beused for moderate bandwidth communication.

In one scenario, the communication network 154 may be an internet basednetwork. For such networks, a delivery standard may be used, forexample, National Television System Committee (NTSC), (Phase AlternatingLine) PAL, Sequential Color with Memory (SECAM), Moving Picture ExpertsGroup (MPEG), and (Digital Video Broadcasting) DVB-S/DVB-S2 or(Integrated Services Digital Broadcasting) ISDB-S. Although, only asingle communication network has been illustrated in FIG. 1, there maybe more than one communication networks that may or may not shareresources to optimally deliver programming content and othernon-programming content to various connected client devices.

In operation, a request (i.e. a user request or a client request) may betransmitted by the client device 138 to the video service provider 110,via the communication network 154. The request may be made for aninstructed delivery and playback of programming media content at theclient device 138. In some embodiments, along with the transmittedrequest, the client device 138 may also transmit request information,such as user information, constraint information (e.g. deviceconstraints, network constraints, user-specified constraints, etc.),stored/collected user preferences (favorite shows, preferredproducts/services, frequently/recently searched terms, etc.), userattributes (e.g., demographic details, such as age, gender, incomebracket, and location), and digital footprints. Examples of digitalfootprints includes, but are not limited to traceable, inferred digitalactivities, analyzed digital activities, actions, contributions, andcommunications on internet or on digital devices, such as the clientdevice 138 or the secondary device 140. The request along with therequest information may be stored in a user database of the user manager112 present in a network of the video service provider 110.

In some embodiments, the client device 138 may be communicativelycoupled to a proxy server (not shown in FIG. 1) that acts as anintermediary between the client device 138 and at least one of thecontent producer/primary distributor 102, the video service provider110, the interactive service provider 120, the interactive serviceprovider partners 148, or the content/watermark/trigger recognizer 144.All the requests from the client device 138 may be parsed, processed,and/or routed by the proxy server to at least the contentproducer/primary distributor 102, the video service provider 110, theinteractive service provider 120, the interactive service providerpartners 148, and the content/watermark/trigger recognizer 144.

The content producer/primary distributor 102 may be configured totransmit media content (programming and/or non-programming, for e.g., TVshows, live shows, movies, advertisement videos, graphics (e.g., bugs,logos, banners, tickers, popups, ⅓^(rd) graphics, etc.), promotionalvideos, promotional audio, etc.) to the video service provider 110. Thenon-programming media content may include promotional media content forat least one of a product offering or a service offering. Thepromotional media content may include at least one of graphical content,textual content, video content, or animated content. The non-programmingmedia content may be transmitted by the content producer/primarydistributor 102 in an attempt to have a user engagement with thenon-programming media content that may be played along with programmingmedia content at the client device 138. Examples of the programmingmedia content may include, but are not limited to episodes of differentshows, movies, live matches, trailers, program recap, programs to beaired, program highlights, sports highlight, sports analysis, news feed,or other television programming, and on-demand shows (e.g. pre-producedor live shows, etc.), television ads, and traditional videoadvertisements.

The programming content store 114 may be configured to receive the mediacontent (programming and/or non-programming) from the pre-producedmaster content store 108 of the content producer/primary distributor102. The received media content (programming and/or non-programming) maybe stored along with the media assets (e.g., individual episodes,segments, or curated portions of shows, live content, or pre-packagedcontent) at the programming content store 114. In some embodiments, thevideo service provider 110 may be configured to transmit the receivedmedia content (programming and/or non-programming) to the interactiveservice provider 120. Additionally, a request may be transmitted by thevideo service provider 110 to the interactive service provider 120, foranalysis of the media content (programming and/or non-programming) andinsertion of trigger identifiers with the media content(programmingand/or non-programming). When detected, such trigger identifiers mayindicate event opportunities in a playback duration of the media contentthat may be served by the video service provider 110 at the clientdevice 138.

The event opportunities may correspond to candidate time intervals,which may be pre-specified or identified based on an analysis of theprogramming media content. Such event opportunity may correspond to aspecified event within the media content and such specified event maycorrespond to, but not limited to start credits, end credits, lullpoints, time-based marked regions, location-based marked regions,context-based marked regions, explicit scene start and end, and productplacement opportunities, within the programming media content.

The interactive service provider 120 may be configured to receive themedia content (programming and/or non-programming) along with a requestfor media analysis and insertion of trigger identifiers at differentevent opportunities in the media content. Initially, the media analysissystem 126 may be configured to analyze the media content (programmingand/or non-programing) received from the video service provider 110 toidentify slots, contextual points, product placement opportunities,etc., in the media content. More specifically, the media analysis system126 may be configured to analyze the programming media content (i.e., TVshows, movies, etc.) to look for non-programming content, such as brandlogos, but may also look for more contextually relevant opportunities,such as a coffee scene or a breakfast scene, or a restraint scene. Suchcontextually relevant opportunities may translate to ad or offerplacement opportunities that may be relevant to an advertiser and anon-programming media element can be overlaid at the contextuallyrelevant opportunity, e.g., a burger order logo (user-selectable) over abreakfast scene. The media analysis system 126 may also analyzeadvertisements so as to make such advertisements interactive at the timeof presentation on the client device 138.

In some embodiments, the media analysis system 126 may implement acontent recognition service (not shown in FIG. 1) that may be configuredto execute operations associated with content recognition on the mediacontent received from different content producers/content distributors(e.g., the content producer/primary distributor 102). The contentrecognition service may operate as a statistical signal processingengine that may execute operations for the recognition of programmingand/or non-programming assets (e.g., contextually relevant segments inprograms, programming videos, video advertisements, etc.) in the mediacontent. The content recognition service may further identify differentattributes (e.g., audio or video fingerprints) of the programming and/ornon-programming media assets (as part of media content) throughimplementation of a technique, such as acoustic fingerprinting, digitalwatermarking, and digital video fingerprinting. In certainimplementations, supervised or unsupervised machine learning, deeplearning, neural network, or artificially intelligence based techniquesmay be implemented to facilitate precise determination of differentattributes of the media items in the media content. In certainembodiments, the content recognition service may be implemented on aspecialized circuitry or an application at the trigger management system122. For such implementations, the content recognition service mayutilize computational resources of a graphical processing unit (GPU) atthe trigger management system 122.

After the analysis, different event opportunity points that arespecified by the video service provider 110 may be identified along withmetadata and other details of different programming and/ornon-programming assets (as part of the media content) by the mediaanalysis system 126. Examples of the different features may include, butnot limited to contextual information of different scenes/portions of aprogramming and/or a non-programming asset, a point in time to triggeran overlay in a programming and/or a non-programming asset, a suitableposition in a programming and/or a non-programming asset where anoverlay may be placed, or other playback information (e.g., playbackduration, start/end time stamps, etc.). The contextual information maybe a context of a scene, for example, sports, health, baby products, carpromo, and the like. A record of the analysis may be updated to thetrigger management system 122. The trigger management system 122 maystore the record generated post the analysis of the media content.

In some embodiments, the received request from the video serviceprovider 110 may also include information that specifies a type ofdetection that should be used at the client device 138. The type ofdetection may include, but not limited to, watermark detection,fingerprint detection, trigger identifier detection, tags-baseddetection, or a combination thereof. Based on the type of detection, theinteractive service provider 120 may insert fingerprint, watermark,trigger identifiers, or associated tags with the programming mediacontent. In other embodiments, in absence of such information with thereceived request, the interactive service provider 120 may be configuredto optimally select a type of detection suitable for execution at theclient device 138.

In accordance with an embodiment, the fingerprinting system 130 may beconfigured to fingerprint the programming media content and thenon-programming media content and generate fingerprint information forthe programming media content and the non-programming media content. Thefingerprint information may include at least one of acoustic fingerprintinformation and video fingerprint information for different segments (orselected audio or image frames) of the media content. The generatedfingerprint information for an asset (programming and/ornon-programming) may be utilized to uniquely identify and detectplayback of the asset at different client devices (e.g., the clientdevice 138).

As an example, the acoustic fingerprint information may include, but notlimited to, an average zero crossing rate, an estimated tempo, anaverage spectrum, a spectral flatness, and prominent tones across a setof frequency bands, a bandwidth, and a spectrogram for a segment of theaudio portion. Similarly, the video fingerprint information may include,but not limited to, histogram data, intensity information, motionvectors, feature points extracted based on application of at least oneof Scale Invariant Feature Transform (SIFT), Speeded Up Robust Features(SURF), Oriented Brief and Fast (ORB), and Histogram of OrientedGradients (HOG), and the like.

In accordance with another embodiment, the watermarking system 132 maybe configured to insert digital watermarks (visually or auditoryperceptible or imperceptible) into an audio portion and/or a videoportion of the media content (programming and/or non-programming) atanalyzed event opportunities in the media content. The watermarkingsystem 132 may be configured to generate watermark information, whichmay include metadata for at least one of an acoustic watermark or avideo watermark inserted between different segments (or selected audioor image frames) of the media content. The generated watermarkinformation for an asset in the media content may be utilized touniquely identify and detect playback of the asset at the client device138. The watermark information may further include details (e.g., title,duration, start time, end time, tags, etc.) of an asset in the mediacontent.

In some cases, the watermarks and/or fingerprints may be primarilyintended for the programming media content to identify events fortriggers. The non-programming assets may be typical fingerprinted orwatermarked to detect playback for tracking or in the case thenon-programming content is overlaid in the broadcast distribution tomake it interactive on the enabled client devices (e.g., the clientdevice 138). As an example, in a TV show with a beach scene,fingerprints, watermarks, or trigger identifiers, may be generated for apositive sentiment beach scene. Such fingerprints, watermarks, ortrigger identifiers may indicate an event opportunity for which apersonalized travel or vacation offer may be served on the client device138, for viewers that may like going to the beach. Thewatermark/fingerprint/trigger registry 136 may also store informationaround non-programming content, such as a promotional overlay for a newTV show, to enable the interactive service provider 120 to keep a trackof users who may have viewed the promotional overlay for the new TVshow. The interactive service provider 120 may also allow the user(through the client device 138 or other input devices) to act upon thepromotional overlay, such as scheduling a DVR recording or adding it tothe watch list.

In certain embodiments, the watermarking system 132 may also analyze andinsert digital watermarks in the programming media content to enableuser engagement (in response to user inputs) at the client device 138(or other input devices) with different product placements during theplayback of a program, e.g., a TV show at the client device 138. Also,such digital watermarks may also be used to garner revenue fromcontextual advertising, i.e., a “food brand” bug (i.e., an on-screengraphic) may be presented during a breakfast scene in a program playedat the client device 138, where coffee is consumed in the breakfastscene. Also, such digital watermarks may be utilized to insert a logo ona coffee cup itself to enable real-time product placement to a highestbidder for that customer.

The trigger management system 122 may be further configured to store thegenerated fingerprint information and the watermark information in thewatermark/fingerprint/trigger registry 136. After insertion of at leastone of fingerprints, watermarks, or tags with the media content, theinteractive service provider 120 may be configured to transmit the mediacontent (programming and/or non-programming) inserted with at least oneof watermarks or fingerprints back to the video service provider 110.The programming content store 114 at the video service provider 110 maybe configured to receive and store the media content inserted with atleast one of watermarks or fingerprints. Further, the programmingcontent store 114 may be further configured to transfer the mediacontent inserted with at least one of watermarks or fingerprints to themedia encoder/packager 116.

In some embodiments, when the media content is prepared for delivery tothe client device by the media encoder/packager 116, the video serviceprovider 110 may be configured to transmit a request to the interactiveservice provider 120. Such request is transmitted to insert a pluralityof trigger identifiers (e.g., Society of Cable and Telecom Engineers(SCTE-35) triggers, SCTE-104 triggers, .m3u8 playlists, .mpd manifesttags, Nielsen ID3 tags, inaudible beacons, data beacons, etc.) atdifferent event opportunities in the media content. The request may betransmitted via a set of calls (e.g., API calls) initialized by thetrigger insert SDK 118 that is present with the video service provider110.

In response to the set of calls via the trigger insert SDK 118, thevideo service provider 110 may create API-based integration with theinteractive service provider 120. The API-based integration mayfacilitate the interactive service provider 120 to insert/embed theplurality of trigger identifiers at different event opportunities in themedia content. A trigger scheduler (not shown in FIG. 1) in the triggermanagement system 122 may be configured to receive the request andgenerate a plurality of trigger identifiers (e.g., a trigger identifiermay include “255 bytes” of information that includes at least an assetor event identifier (ID)). The size of a trigger identifier may be lessthan or more than 255 bytes, without a deviation from scope of thedisclosure. The trigger scheduler may also generate event informationfor each generated trigger identifier. The event information mayinclude, but not limited to, asset ID of a media asset, an overlayposition, a graphic asset that may be overlaid, an overlay duration,and/or other presentation attributes (e.g., on-screen position,transparency, etc.).

The trigger scheduler may be further configured to transmit thegenerated plurality of trigger identifiers in response to the receivedrequest from the video service provider 110. The trigger scheduler maybe further configured to insert the plurality of trigger identifiers ata plurality of event opportunities in media content prepared by thevideo service provider 110. The video service provider 110 may bedifferent from the interactive service provider 120. The insertion maybe executed by the interactive service provider 120 via a firstintegration with the video service provider 110. The first integrationmay correspond to an API-based integration, of the interactive serviceprovider 120 with the video service provider 110, based on the triggerinsert SDK 118. In some cases, the prepared media content may include aclient manifest, which is transmitted along with a media stream (orseparately) of the prepared media content. The client manifest may beutilized to call for stream segments and/or may include the plurality oftrigger identifiers along with the trigger metadata (e.g. triggerduration, trigger end time, trigger start time, etc.). The triggerscheduler may be further configured to store the generated eventinformation associated with each trigger identifier in thewatermark/fingerprint/trigger registry 136.

In certain embodiments, the trigger management system 122 may include atagging system (not shown in FIG. 1). The tagging system may also placeone or more tags with the non-programming media content. Such tags maybe invoked at the client device 138 while non-programming media contentis played at the client device 138. Each tag may include, but is notlimited to, at least a pointer for a defined playback duration in acandidate time interval (e.g., “00:33:43”-“00:35:43” or “2 minutes” from“00:33:43”). The one or more tags may be transmitted along with a clientmanifest that may be created by the media encoder/packager 116, inconjunction with inputs from the trigger insert SDK 118. The clientmanifest may be interpreted at the client device 138 and the one or moretags present along with the client manifest may be represented by atleast one of a signal, a beacon, a SCTE-35 message, a marker, textualinformation, binary information, visual information, aural information,and the like.

In certain embodiments, the video service provider 110 may be configuredto modify one or more of the plurality of trigger identifiers insertedin the prepared media content at the media encoder/packager 116. Forexample, the modification may include, but not limited to, resetting theposition of the trigger identifier in the prepared media content, adeletion of a trigger identifier previously inserted in the preparedmedia content, a change in metadata of the trigger identifier (e.g.,trigger ID), and the like. In other embodiments, instead ofmodification, the video service provider 110 may be configured to add anew trigger identifier at an event opportunity in the prepared mediacontent. The modification of one or more of the plurality of triggeridentifiers or addition of a new trigger identifier may be done based onresources and services of the trigger insert SDK 118. The video serviceprovider 110 may be configured to generate, using the trigger insert SDK118, event information for each modified trigger identifier or each newtrigger identifier added in the prepared media content. After everymodification or update, the video service provider 110 communicates thegenerated event information and/or an event log (which includes a listof modifications) for an update at the watermark/fingerprint/triggerregistry 136. The watermark/fingerprint/trigger registry 136 may includedatabases that may be sorted, curated, updated, and structured such thatin an event of a call from the client device 138, a search may beexecuted in such databases to extract information (i.e., watermarkinformation, fingerprint information, or event information) thatcorresponds to a detected watermark, fingerprint, or trigger identifierat the client device 138.

In accordance with an embodiment, the media encoder/packager 116 at thevideo service provider 110 may be further configured to transmit a mediastream of the prepared media content to the client device 138, via thecommunication network 154. The media stream may be generated byencapsulation of the prepared media content (inserted with triggeridentifiers) in a transport stream based on a digital streaming protocolor a real time streaming protocol (RTSP)). Examples of the digitalstreaming protocol may include, but are not limited to, HTTP LiveStreaming (HLS), Adaptive Bitrate Streaming (ABR), MPEG-DASH, SmoothStreaming, and HTTP Dynamic Streaming (HDS).

At the client-end, the client device 138 may be configured to receivethe media stream of the prepared media content, via the communicationnetwork 154. In some embodiments, instead of the client device 138, aproxy server may receive the media stream of the prepared media contentand further transmit the same media stream to the client device 138, viathe communication network 154. Upon reception, the client device 138 mayexecute playback operations (e.g., decoding media content, renderinguser interface, loading non-graphical and graphical elements,initializing player, displaying decoded media content onto a playerwindow, etc.) for the programming media content onto the display 142associated with the client device 138.

The plurality of trigger identifiers may be present with the mediacontent (e.g., in the client manifest or embedded in the media streamthat includes the programming and/or the non-programming media content)that is served at the client device 138. In some cases, digitalwatermarks (audio or video watermarks generated by the watermarkingsystem 132) may be present at different candidate points (or eventopportunities in the media content. Also, in other cases, digitalfingerprints (audio or video fingerprints generated by thefingerprinting system 130) may be present with the media content.Therefore, the client device 138 may be configured to detect at leastone of the inserted digital watermarks, the digital fingerprints (ofdifferent segments or scenes, which may render an event opportunity),and the plurality of trigger identifiers present with the media content.The detection may be instructed by the interactive service provider 120via a second integration with the client device 138. The secondintegration may correspond to an API-based integration of theinteractive service provider 120 with the client device 138 based on thetrigger client SDK 146.

In some cases, the client application operational on the client device138 and responsible for playback of the media content may lack thecapability to detect and recognize at least one of the inserted digitalwatermarks, the digital fingerprints, and the plurality of triggeridentifiers in the media content. Therefore, in such cases, the clientdevice 138 may be configured to request the content/watermark/triggerrecognizer 144 to detect and recognize at least one of the inserteddigital watermarks, the digital fingerprints, and the plurality oftrigger identifiers in the media content, while the client applicationis playing the media content onto the display 142.

The content/watermark/trigger recognizer 144 may detect at least one ofthe inserted digital watermarks, the digital fingerprints, and theplurality of trigger identifiers in the media content. For the detecteddigital watermarks, digital fingerprints, or trigger identifiers, thecontent/watermark/trigger recognizer 144 may utilize the functionalitiesof the trigger client SDK 146 to identify information (e.g., watermarkinformation, fingerprint information, event information, etc.)associated with the detected digital watermarks, digital fingerprints,or trigger identifiers. Such information may include details of what todo next, when the detection occurs, and where to request for informationon the next action. For example, detection of a trigger identifier maylead to determination of an event ID (e.g., 255 bytes ID).

The event ID may be insufficient for the client device 138 to execute anaction, such as contacting the overlay ad server 150, retrieving theoverlay graphics, presenting the retrieved overlay graphics, andactivating the trigger inputs (input devices or overlay trigger inputs).Therefore, the content/watermark/trigger recognizer 144 may beconfigured to transmit a request to watermark/fingerprint/triggerregistry 136 to determine at least one of fingerprint information,watermark information, or event information for detected digitalfingerprints, detected digital watermarks, or one or more detectedtrigger identifiers, respectively. In some cases, the request may betransmitted through instruction calls raised by the trigger client SDK146 on behalf of the client device 138.

The watermark/fingerprint/trigger registry 136 may be configured toreceive the request from the content/watermark/trigger recognizer 144,or the trigger client SDK 146. In response to the received request, thewatermark/fingerprint/trigger registry 136 may be configured to searchfor at least one of fingerprint information, watermark information, orevent information for detected digital fingerprints, detected digitalwatermarks, or one or more detected trigger identifiers, respectively.The watermark/fingerprint/trigger registry 136 may be further configuredto communicate the fingerprint information, the watermark information,or the event information to the client device 138, for detected digitalfingerprints, detected digital watermarks, or one or more detectedtrigger identifiers, respectively.

The client device 138 may be configured to receive the communicatedfingerprint information, the watermark information, or the eventinformation from the watermark/fingerprint/trigger registry 136, via thetrigger client SDK 146. The fingerprint information, the watermarkinformation, or the event information may include instructions (or othermetadata) that specify at least an address (e.g., IP address) of theoverlay ad server 150, an asset identifier (ID) of an overlay graphicthat has to be presented at the event opportunity, permissioninformation (e.g., information to access data from the overlay ad server150), and the like. For each event opportunity, the client device 138may seek permissions to present an overlay graphic onto the display 142,from at least one of the video service provider 110 and the graphicoverlay manager 134. In response to permission requests, the clientdevice 138 may receive decisions regarding whether to present an overlaygraphic onto the display 142 for a detected event opportunity (specifiedby a watermark, a fingerprint, or a trigger identifier). The clientdevice 138 may receive such decisions (or decisioning information) fromthe video service provider 110 or the interactive service provider(e.g., the graphic overlay manager 134 at the interactive serviceprovider 120).

In some embodiments, the decision regarding whether to act upon theevent opportunity may be taken based on a defined criteria. The definedcriteria may include, but not limited to a retrieved set ofuser-preferences, a defined goal in a flight (defined by the videoservice provider 110), a measured or known user attention span, timeconstraints, inventory constraints, user-defined constraints, or acombination thereof. For example, the defined goal may include, but isnot limited to a defined value of cost per thousand impressions (CPM), aclick through rate (CTR), cost per sale (CPS), and a reach target.Similarly, the time constraints may include, but are not limited to alimited duration for which the user may stay online and stream theprogramming media content, a limit to a total number of candidate timeintervals (event opportunities) in the programming media content, and anaverage attention span of the user.

The limit on user availability for the duration of programming mediacontent may also set a limit on the number of candidate time intervalsin a limited duration of the programming media content. The inventoryconstraints may include, but are not limited to an availability ofoverlay graphics that can be presented in the identified candidate timeinterval and a limit on a number of overlay graphics that matches up tothe parameters in the request information. The user-defined constraintsmay include but are not limited to, a constraint on number ofadvertisements, a cap on number of times a product advertised may berendered purchasable, a content filter for a specific viewer (forexample, filter content that is not intended for young kids andteenagers (below the age of 18 years).

The client device 138 may be further configured to transmit a request tothe overlay ad server 150, to transmit overlay graphics for the eventopportunities for which the permissions to present overlay graphics (anddeliver services) have been granted by the video service provider 110 orthe graphic overlay manager 134. The overlay ad server 150 may beconfigured to receive the request from the client device 138 and inresponse to the received request, the overlay ad server 150 may transmitoverlay graphics (along with graphic metadata) to the client device 138.More specifically, the overlay ad server 150 may receive asset ID (ortrigger ID and event ID) for each event opportunity for which permissionto overlay graphics and render different services has been granted. Theoverlay ad server 150 may be further configured to search for overlaygraphics that match the asset ID (or trigger ID and event ID). In somecases, the overlay ad server 150 may also specify, in the graphicmetadata, different types of services that can be delivered through eachoverlay graphic, via the client device 138.

On receipt of the overlay graphics, the client device 138 may beconfigured to render a plurality of overlay graphics on the mediacontent (programming and non-programming) for a plurality of candidatetime intervals (corresponding to a plurality of event opportunities)within the playback duration of the media content. The plurality ofoverlay graphics may be rendered as a transparent layer placed on top(or sandwiched between different graphical layers) of the media content.An overlay graphic rendered on the media content may act as a triggerinput, or more specifically, a user-selectable option when activated atthe client device 138. However, in some cases, initially, the overlaygraphic rendered on the media content may be presented in a deactivatedstate. The overlay graphic may be a specific bug (i.e., a digitalon-screen graphic) for a product that is promoted in a non-programmingasset played at the client device 138, a graphical button, a productmark, and the like.

In some embodiments, the overlay graphics may be rendered on aninteractive platform (or a viewable region of an application interface)of a client application in the secondary device 140. A notification maybe server onto the secondary device 140 and upon user selection of theserved notification, the client application may pop up on the displayscreen of the secondary device 140 and an overlay graphic (e.g., anon-screen user-selectable button) may be displayed in the clientapplication interface. The decision to present an overlay graphic on thesecondary device 140 may be taken by at least one of the interactiveservice provider 120, the video service provider 110, or the clientdevice 138, based on usage statistics of the secondary device 140 anduser accessibility to engage with the overlay graphic (if activated) onthe secondary device 140.

The client device 138 may be further configured to search (or identify)for available set of input devices (wired or wireless) that may bepresent in the vicinity of the client device 138 or the secondary device140 (which may be paired/unpaired) with the client device 138. Exampleof an input device may include, but not limited to, TV remote,smartphone, a haptic switch, a gesture sensor, a motion sensor, a smartspeaker, a paired (or connectable) microphone, or a specialized physicalinput device (e.g., a button that is common for every product, or aproduct-specific button). Thereafter, the client device 138 may befurther configured to pair up with one or more of the set of inputdevices available in vicinity of the client device 138 or the secondarydevice 140 paired with the client device 138. The client device 138 maypair up with one or more of the set of input devices, via a Bluetoothnetwork, a Wi-Fi network, an internet-based network, a wired localnetwork, a wireless ad hoc network, and the like.

In some cases, instead of pairing up, an input device (e.g., a smartspeaker) may listen (or recognize) to audio (or visual)markers/beacons/signals during the playback of the media content, andtherefore, may not require pairing connections with the client device138. Alternatively, some virtual voice assistants may only be looselypaired (selectively paired) with the client device 138 based on aproximity and an ability to hear the audio from the client device 138.It may be the interactive service provider (or a partner) that mayconfigure the client device 138 to detect triggers. The user may have toaccept or approve the configuration through the secondary device 140(e.g., a smartphone) that is different from the client device 138 (e.g.,a smart TV) receiving the prepared media content from the video serviceprovider 110. Alternatively stated, a voice assistant application on thesecondary device 140 (in vicinity of the client device 138) may alsoinstall an voice assistant service that may offer relevant offers fromthe interactive service provider 120 (or other partners) while the useris engaged in watching the media content played on the client device138. Also, in some cases, the input device may be loosely paired or evenunpaired with the client device 138, and therefore, the input device maylisten for a trigger identifier in the audio portion of the mediacontent automatically based on instructions from the interactive serviceprovider 120.

A check may be further executed at the client device 138 to decidewhether to activate one or more input devices (paired/unpaired with theclient device 138) from the set of input devices, the overlay graphicsrendered on the media content or the overlay graphics rendered at thesecondary client device 140. The decision for activation may be takenbased on the check executed on a defined criteria that is associatedwith stored records of previous service requests for event opportunitiesalready on which services were delivered in past. The defined criteriamay include at least one of a user-defined constraint, an inventoryconstraint, and a specified threshold count of trigger responses for aproduct offering or a service offering promoted by the non-programmingmedia content. In some cases, the decisions may be taken to minimize aneffect of disturbance or diversions in viewing experience for a user whomay wish to watch the programming media content without any diversionscaused by promotional content (or user selectable options).

The client device 138 may be further configured to activate, via thetrigger client SDK 146, at least one of one or more input devices invicinity of the client device 138 or the rendered plurality of overlaygraphics on the media content or at the secondary device 140. At leastone of the one or more input devices may be paired with the clientdevice 138 or the secondary device 140. In some cases, the triggerclient SDK 146 may also be utilized to activate the one or more inputdevices (paired/unpaired with the client device 138). The activation maybe done based on the decisions taken (which in turn may be based on thedefined criteria) by the interactive service provider 120. Theactivation may be followed by a notification (e.g., a notification lightthat blinks on the TV remote when the TV remote is activated to send atrigger response, a vibration pattern, a tone with a specific pattern,an on-screen visual notification, etc.) to alert the user associatedwith the client device 138 regarding activation of a paired/unpairedinput device, or a rendered overlay graphic. The overlay graphic that isrendered on the display 142 may specify types of services that the usercan avail if selected through the activated input device(s) or overlaygraphics. The types of services may include, but not limited to one of adirect payment, a direct transaction, or a direct notification for aproduct offering or a service offering, and a direct update of aselected product offering or a service offering in a shopping cart, apersonalized list generation, or sharing on a social platform.

In some cases, the client device 138 may be configured to receive one ormore trigger responses over an activated overlay graphic on the mediacontent (during playback duration of the media content), via anactivated input device or an overlay trigger input (also referred to asan activated overlay graphic). The one or more trigger responses maycorrespond to one or more user interactions that are received based onuse inputs. Example of user inputs include, but are not limited to atouch input, a gesture input, a haptic feedback input, or a voicecommand input. One or more notifications may be received on at least oneof the client device 138 or the secondary device 140 paired with theclient device 138, in response to the received one or more triggerresponses. Also, more notifications may be delivered when an overlaygraphic or an input device is not enabled based on the defined criteria,or when service is selected for delivery, and the like.

In response to a receipt of the one or more trigger responses, theclient device 138 may be further configured to transmit a request to thefulfillment server 152 to deliver services. Examples of the differentservices that may be delivered through the client device 138 or thesecondary device 140, may include, but are not limited to an OTT servicethrough internet, a service that can be delivered through broadcastnetworks on different client devices, such as ATSC smart TVs (e.g., ATSC3-enabled TV), presentable (visual, aural, etc.) services, consumableservices, online services, purchase service, or other access independentservices or dependent services. The fulfillment server 152 may beconfigured to receive the request from the client device 138 fordelivery of services (or service offerings) that were promoted bydifferent assets (e.g., video ads or graphical overlays) in the mediacontent at the client device 138. The fulfillment server 152 may deliverthe services in response to the received one or more trigger responseson the event opportunities in the programming media content.

The client device 138 may be further configured to display aninteractive platform (e.g., an interactive product display and paymentpage etc.) on the client device 138, to enable delivery of the servicesin response to the received one or more trigger responses. The deliveryof the services may be enabled at the client device 138 in communicationwith the fulfillment server 152 that fulfills the delivery of theservices to increase a user engagement to select services.

For example, the fulfillment server 152 may render a shopping cart witha product offering (promoted in the non-programming media content at anevent opportunity) selected based on a trigger response. In some cases,instead of a shopping cart, a selected product offering may be directlyordered by the fulfillment server 152 based on pre-specified paymentinformation and address information in the user database.

In an exemplary embodiment, a virtual voice assistant may be enabled onthe input device or at least one of the client device 138 and thesecondary device 140. Also, the virtual voice assistant may beaccessible through a voice command device (e.g., a smart speaker). Theinput device may be configured to actively wait and listen for a triggeridentifier in the audio portion of the media content. The triggeridentifier in the audio portion may be an inaudible beacon (ultrasonicor infrasonic frequency range), an audible beacon, and the like. Thevoice command device or the virtual assistant may detect beacons withinthe playback duration of the programming media content and facilitate aconversation with the user to receive a trigger response from the user.The trigger response may be received through a user's speech input. Thevoice command device or the virtual assistant may communicate thetrigger response to the client device 138, which may further communicatethe trigger response to the fulfillment server 152. Thereafter, thefulfillment sever 152 may be further configured to deliver services(offered with the playback of the media content) based on the receivedtrigger response in the conversation between the user and the voicecommand device or the virtual assistant.

In certain embodiments, all the operations executed by the client device138 except the playback of media content onto the display 142 may beexecuted by a proxy server (not shown in FIG. 1) on behalf of the clientdevice 138. Such execution may not affect the functional performance ofthe client device 138 or expected results from different operationsdescribed in FIG. 1, without a deviation from scope of the disclosure.

FIG. 2 is a block diagram that illustrates an exemplary triggermanagement system that handles insertion and activation of triggeridentifiers in media content received from different video serviceproviders, in accordance with an exemplary embodiment of the disclosure.FIG. 2 is explained in conjunction with elements from FIG. 1. Withreference to FIG. 2, there is shown a block diagram 200 of the triggermanagement system 122. The trigger management system 122 may include anintegration manager 204, a content recognizer 206, an event opportunitystore 208, and a trigger scheduler 210. The trigger management system122 may also include the fingerprinting system 130 and the watermarkingsystem 132, as shown in FIG. 1.

The integration manager 204 may comprise suitable logic, circuitry, andinterfaces that may be configured to authenticate API requests or SDKintegration requests from different video service providers (e.g., thevideo service provider 110) or different client devices (e.g., theclient device 138). Such authentication may be done based on sharinglicenses, certificates, agreements, or public/private key for access tovarious APIs, the trigger insert SDK 118, or the trigger client SDK 146.

The content recognizer 206 may comprise suitable logic, circuitry, andinterfaces that may be configured to execute operations associated withcontent recognition of programming or non-programming media contentreceived from different video service providers (e.g., the video serviceprovider 110). The content recognizer 206 may operate as a statisticalsignal processing engine that may execute operations for the recognitionof assets (e.g., segments that offer product placement opportunities(e.g., coffee scene), video advertisements, overlay graphics (such as,bugs, logos, ⅓^(rd) graphics, banners, tickers, etc.) in the mediacontent. The content recognizer 206 may further identify differentattributes (e.g., audio or video fingerprints) of the programming and/ornon-programming media assets (as part of non-programming media content)through implementation of a technique, such as acoustic fingerprinting,digital watermarking, and digital video fingerprinting.

In certain implementations, supervised or unsupervised machine learning,deep learning, neural network, or artificially intelligence basedtechniques may be implemented to facilitate precise determination ofdifferent attributes of the media items in the media content. In certainembodiments, the content recognizer 206 may be implemented as aspecialized circuitry or an application at the trigger management system122. For such implementations, the content recognizer 206 may utilizecomputational resources of a graphical processing unit (GPU) at thetrigger management system 122.

The event opportunity store 208 may comprise suitable logic, circuitry,and interfaces that may be configured to store a scheduled list of eventopportunities (represented by candidate time intervals or on-screenregions in candidate time intervals) in the media content (programmingor non-programming). The scheduled list of event opportunities may bereceived from the video service provider 110 or may be generated by thetrigger management system 122 based on an analysis of the media contentreceived from the video service provider 110.

The trigger scheduler 210 may comprise suitable logic, circuitry, andinterfaces that may be configured to receive a request from the videoservice provider 110 and generate a plurality of trigger identifiers(e.g., a trigger identifier may include “255 bytes” of information thatincludes at least an asset or event identifier (ID)). The triggerscheduler 210 may also generate event information for each generatedtrigger identifier. The event information may include, but is notlimited to asset ID of a media asset, an overlay position, a graphicasset that may be overlaid, an overlay duration, or other presentationattributes. The trigger scheduler 210 may be further configured totransmit the generated plurality of trigger identifiers in response tothe received request from the video service provider 110 and furtherinsert the plurality of trigger identifiers in the media content atdifferent event opportunities, through the API-based integration withthe video service provider 110 via the trigger insert SDK 118. Thetrigger scheduler 210 may be further configured to store the generatedevent information associated with each trigger identifier in thewatermark/fingerprint/trigger registry 136.

FIGS. 3A and 3B, collectively depict a first exemplary scenario thatillustrates a client-side presentation of overlay graphics and receiptof trigger response over presented overlay graphics, via an inputdevice, in accordance with an embodiment of the disclosure. FIGS. 3A and3B are explained in conjunction with elements from FIG. 1 and FIG. 2.With reference to FIGS. 3A and 3B, there is shown a first exemplaryscenario that shows two different states at different time instants(i.e., a first state 300A and a second state 300B) in which a purchaseservice is rendered based on user-options activated while programmingmedia content is received and played onto the display 142.

In the first state 300A of FIG. 3A, a user 302 may be engaged inwatching programming media content (e.g., a favorite TV show) on atelevision channel (represented by a channel logo 304). An input device306 that is paired with the client device 138 may be held in the hand ofthe user 302. While an advertisement media 308 is played on the channel(represented by the channel logo 304), the client device 138 may presentan overlay graphic 310 (represented as a “buy option” in the FIG. 3A)and activate the input device 306 to purchase a product offering 312promoted in the advertisement media 308 presented onto the display 142.

In second state 300B of FIG. 3B that occurs after the first state 300Aof FIG. 3A, upon activation of the input device 306, a notificationlight 306A on the input device 306 may blink in accordance with aspecific pattern. Such blinking may alert the user 302 to act within aduration for which the input device 306 is activated. In an event wherethe button on the input device 306 is pressed (or selected), thefulfillment server 152 may receive a trigger response (which may includedetails of the selected product offering 312 and user details (e.g.,payment details, delivery address, etc.)). Upon receipt of the triggerresponse, the fulfillment server 152 may generate an interactive view(i.e., an interactive platform or page) that may be displayed onto thedisplay 142. The interactive view may show the selected product offering312 getting transferred to shopping cart 314 and a confirmation messagemay pop up on the display 142. Upon confirmation, the fulfillment server152 may be configured to complete a transaction and the selected productoffering 312 may be ordered for delivery.

FIG. 4 depicts a second exemplary scenario that illustrates client-sidepresentation of overlay graphics and receipt of trigger response overpresented overlay graphics, via a trigger input rendered on a secondarydevice paired with client device, in accordance with an embodiment ofthe disclosure. FIG. 4 is explained in conjunction with elements fromFIGS. 1, 2, 3A, and 3B. With reference to FIG. 4, there is shown asecond exemplary scenario that shows a state 400 in which a purchaseservice is rendered based on user-options activated whilenon-programming media content is streamed and played on the display 142.

In the state 400, a user 402 may be engaged in watching programmingmedia content (e.g., favorite TV show) on a television channel(represented by a channel logo 404). A smartphone 406 (i.e., thesecondary device 140) that is paired with the client device 138 may beheld in the hand of the user 402. While the non-programming mediacontent, such as an advertisement media 408, is played on the channel(represented by the channel logo 404), an overlay graphic 410 (which isrepresented by a “graphical button”) is rendered onto the display of thesmartphone 406 and the smartphone 406 is activated to purchase a productoffering 412 promoted in the advertisement media 408 and presented onthe display 142.

Upon activation, the smartphone 406 may respond by generating anotification (e.g., a vibration, a light blink, screen blink, a tone,etc.). Such notifications may alert the user 402 to act within aspecified duration for which the smartphone 406 is activated. In anevent where the overlay graphic 410 on the smartphone 406 is pressed (orselected), the fulfillment server 152 may receive a trigger response(which may include details of the selected product offering 412 and userdetails (e.g., payment details, delivery address, etc.)). Upon receiptof the trigger response, the fulfillment server 152 may generate aninteractive view (i.e., an interactive platform or interactive page)that may be displayed onto the display 142. The interactive view mayshow the selected product offering 412 getting transferred to shoppingcart 414 and a confirmation message may pop up on the display 142. Uponconfirmation, the fulfillment server 152 may be configured to complete atransaction and the selected product offering 412 may be ordered fordelivery.

FIG. 5 depicts a third exemplary scenario that illustrates client-sidepresentation of overlay graphics and receipt of trigger response overpresented overlay graphics based on a direct user selection of presentedoverlay graphics, in accordance with an embodiment of the disclosure.FIG. 5 is explained in conjunction with elements from FIGS. 1, 2, 3A,3B, and 4. With reference to FIG. 5, there is shown a third exemplaryscenario that shows a state 500 in which a purchase service is renderedbased on user-options activated while non-programming media content isstreamed and played on a touch screen device.

In the state 500, a user 502 may be engaged in watching programmingmedia content (e.g., favorite TV show) on a television channel(represented by a channel logo 504). A tablet 506 (i.e., an integrationof the client device 138 and the display 142) may be held in the hand ofthe user 402. While an advertisement media 508 is played on the channel(represented by the channel logo 504) at a certain point of theprogramming media content, an overlay graphic 510 (which is representedby a “graphical button”) may be rendered onto the touch display of thetablet 506. Thereafter, the tablet 506 may be activated to purchase aproduct offering 512 promoted in the advertisement media 508.

Upon activation, the tablet 506 may respond by generating a notification(e.g., a vibration, a light blink, screen blink, a tone, etc.). Suchnotifications may alert the user 502 to act within a specified durationfor which the tablet 506 is activated. In an event where the overlaygraphic 510 is selected by a touch input on the touch screen of thetablet 506, the fulfillment server 152 may receive a trigger response(which may include details of the selected product offering 512 and userdetails (e.g., payment details, delivery address, etc.)). Upon receiptof the trigger response, the fulfillment server 152 may generate aninteractive view (i.e., an interactive platform, page, or UI) that maybe displayed onto the tablet 506. The page may show the selected productoffering 512 getting transferred to shopping cart 514 and a confirmationmessage may pop up on the tablet 506. Upon confirmation, the fulfillmentserver 152 may be configured to complete a transaction and the selectedproduct offering 512 may be ordered for delivery.

FIG. 6 depicts a fourth exemplary scenario that illustrates client-sidepresentation of overlay graphics and receipt of voice responses overpresented overlay graphics, handled by a smart conversational agent, inaccordance with an embodiment of the disclosure. FIG. 6 is explained inconjunction with elements from FIGS. 1, 2, 3A, 3B, 4, and 5. Withreference to FIG. 6, there is shown a fourth exemplary scenario 600 inwhich a purchase service is rendered based on activation of a smartconversational agent while programming media content is streamed andplayed onto the display 142.

Initially, a user 602 may be engaged in watching programming mediacontent (e.g., favorite TV show) on a television channel (represented bya channel logo 604). A smart speaker 606 that includes (or is accessibleto) a virtual voice assistant may be paired with the client device 138.The smart speaker 606 may be configured to actively wait and listen fora trigger identifier (e.g., an inaudible beacon) in the audio portion ofthe programming media content. Thereafter, at a certain time point basedon the trigger identifier, an advertisement media 608 for a productoffering 610 is played on the channel (represented by the channel logo604). The inaudible beacon (i.e., a trigger identifier) present in theaudio portion of the advertisement media 608 may be received by thesmart speaker 606, via a set of microphones in the smart speaker 606,during playback of the audio portion of the advertisement media 608.Upon receipt of the inaudible beacon, the virtual voice assistant in thesmart speaker 606 may detect the inaudible beacon during the playbackduration of the advertisement media 608 and facilitate a conversationwith the user 602 to receive a trigger response from the user 602.

The virtual conversational agent may start the conversation bydescribing about the product offering 610 presented onto the display142. Such conversation may alert the user 602 to act within a durationfor which the smart speaker 606 is activated. In an event where a voiceresponse is said by the user 602 to purchase the product offering 610 inthe conversation, the virtual conversational agent may confirm an orderfor the product offering 610 and the client device 138 may display ashopping cart 612 where the product offering 610 is added. The clientdevice 138 may further transfer the voice response from the user 602 asa trigger response to the fulfillment server 152. The fulfillment server152 may receive a trigger response (which may include details of theselected product offering 610 and user details (e.g., payment details,delivery address, etc.)). Upon receipt of the trigger response, thefulfillment server 152 may be configured to complete a transaction andthe selected product offering 610 may be ordered for delivery.

FIGS. 7A to 7E are flowcharts that collectively illustrate a method fordelivery of different services through different client devices, inaccordance with an embodiment of the disclosure. FIGS. 7A to 7E areexplained in conjunction with elements from FIGS. 1, 2, 3A, 3B, 4, 5,and 6. With reference to FIGS. 7A to 7E, there is shown a flowchart. Theflowchart may depict a method that starts from 702 and proceeds to 704.

At 704, programming and/or non-programming media content may betransmitted to the video service provider 110. The contentproducer/primary distributor 102 may be configured to transmit theprogramming and/or non-programming media content to the video serviceprovider 110.

At 706, programming and/or non-programming media content may be receivedfrom the content producer/primary distributor 102. The video serviceprovider 110 may be configured to receive the programming and/ornon-programming media content from the content producer/primarydistributor 102.

At 708, received programming and/or non-programming media content may betransmitted along with a request for media analysis and triggeridentifier insertion in programming and/or non-programming media contentto the interactive service provider 120. The video service provider 110may be configured to transmit the received programming and/ornon-programming media content along with a request for media analysisand trigger identifier insertion in the programming and/ornon-programming media content to the interactive service provider 120.

At 710, programming and/or non-programming media content along with therequest for media analysis and trigger identifier insertion in theprogramming and/or non-programming media content may be received. Theinteractive service provider 120 may be configured to receive theprogramming and/or non-programming media content along with the requestfor media analysis and trigger identifier insertion in the programmingand/or non-programming media content.

At 712, media analysis may be executed on received programming and/ornon-programming media content in response to received request from thevideo service provider 110. The media analysis system 126 (of theinteractive service provider 120) may be configured to execute a mediaanalysis on received programming and/or non-programming media content inresponse to received request from the video service provider 110.

At 714, fingerprint information may be generated for the receivedprogramming and/or non-programming media content. The fingerprintingsystem 130 (of the interactive service provider 120) may be configuredto generate fingerprint information for the received programming and/ornon-programming media content.

At 716, watermarks may be inserted in the received programming and/ornon-programming media content. The watermarking system 132 may beconfigured to insert the watermarks in the received programming and/ornon-programming media content.

At 718, generated fingerprint information and watermark information maybe stored in the watermark/fingerprint/trigger registry 136. The triggermanagement system 122 (of the interactive service provider 120) may beconfigured to store the generated fingerprint information and watermarkinformation in the watermark/fingerprint/trigger registry 136.

At 720, programming and/or non-programming media content that isinserted with watermarks is transmitted along with fingerprintinformation to the video service provider 110. The interactive serviceprovider 120 may be configured to transmit the programming and/ornon-programming media content that is inserted with watermarks alongwith fingerprint information to the video service provider 110.

At 722, programming and/or non-programming media content that isinserted with watermarks may be received along with fingerprintinformation from the video service provider 110. The video serviceprovider 110 may be configured to receive the programming and/ornon-programming media content that is inserted with watermarks alongwith the fingerprint information from the video service provider 110.

At 724, received programming and/or non-programming media content thatis inserted with watermarks along with fingerprint information may bestored in the programming content store 114. The video service provider110 may be configured to store the received programming and/ornon-programming media content that is inserted with watermarks alongwith fingerprint information in the programming content store 114.

At 726, a request may be transmitted to the interactive service provider120 to insert trigger identifiers in prepared media content at eventopportunities, based on communications via the trigger insert SDK 118.The video service provider 110 may be configured to transmit a requestto the interactive service provider 120 to insert trigger identifiers inprepared media content at event opportunities, based on communicationsvia the trigger insert SDK 118.

At 728, the request to insert trigger identifier in the prepared mediacontent at different event opportunities may be received. Theinteractive service provider 120 may be configured to receive therequest to insert trigger identifier in the prepared media content atdifferent event opportunities.

At 730, trigger identifiers for the received programming and/ornon-programming media content may be transmitted to the video serviceprovider 110. The interactive service provider 120 may be configured totransmit the trigger identifiers for the received programming and/ornon-programming media content to the video service provider 110.

At 732, trigger identifiers may be inserted in the prepared mediacontent at different event opportunities through integrations with thevideo service provider 110, via the trigger insert SDK 118. The triggerscheduler 210 (of the trigger management system 122) may be configuredto insert the trigger identifiers in the prepared media content atdifferent event opportunities through integrations with the videoservice provider 110, via the trigger insert SDK 118.

At 734, trigger identifiers may be added or modified in the preparedmedia content at event opportunities through the trigger insert SDK 118.The video service provider 110 may be configured to add or modifytrigger identifiers in the prepared media content at event opportunitiesthrough the trigger insert SDK 118.

At 736, event information for added or modified trigger identifiers maybe communicated to the interactive service provider 120. The videoservice provider 110 may be configured to communicate the eventinformation for added or modified trigger identifiers may becommunicated to the interactive service provider 120.

At 738, the event information associated with each trigger identifiermay be stored in the watermark/fingerprint/trigger registry 136. Thetrigger management system 122 may be configured to store the eventinformation associated with each trigger identifier in thewatermark/fingerprint/trigger registry 136.

At 740, a media stream that includes the prepared media content insertedwith the trigger identifiers may be transmitted to the client device138. The video service provider 110 may be configured to transmit themedia stream that includes the prepared media content inserted with thetrigger identifiers to the client device 138.

At 742, the media stream that includes the prepared media contentinserted with the trigger identifiers may be received. The client device138 may be configured to receive the media stream that includes theprepared media content inserted with the trigger identifiers from thevideo service provider 110.

At 744, inserted trigger identifiers may be detected in the playbackduration of the programing media content at the client device 138,through the trigger client SDK 146. The client device 138 may beconfigured to request the content/watermark/trigger recognizer 144 todetect inserted trigger identifiers in the playback duration of theprograming media content at the client device 138, through the triggerclient SDK 146.

At 746, a request may be transmitted to thewatermark/fingerprint/trigger registry 136 to determine eventinformation for one or more detected trigger identifiers. The clientdevice 138 may request the content/watermark/trigger recognizer 144 torequest the watermark/fingerprint/trigger registry 136 to determineevent information for one or more detected trigger identifiers.

At 748, the request to determine event information for one or moredetected trigger identifiers may be received. The interactive serviceprovider 120 may be configured to receive the request to determine eventinformation for one or more detected trigger identifiers.

At 750, event information for one or more detected trigger identifiersmay be searched in the watermark/fingerprint/trigger registry 136. Thetrigger management system 122 may be configured to search on eventinformation for one or more detected trigger identifiers in thewatermark/fingerprint/trigger registry 136.

At 752, the event information may be communicated to the client device138 for one or more detected trigger identifiers. The trigger managementsystem 122 may be configured to communicate the event information to theclient device 138 for one or more detected trigger identifiers.

At 754, the communicated event information may be received for one ormore detected trigger identifiers. The client device 138 may beconfigured to receive the communicated event information for one or moredetected trigger identifiers.

At 756, it may be determined whether it is required to present overlaygraphics at event opportunities for which event information is received.The client device 138 may be configured to determine whether it isrequired to present overlay graphics at event opportunities for whichevent information is received. In a case where a requirement exists,control passes to 758. Otherwise, control passes to 744.

At 758, a request may be transmitted to the overlay ad server 150 totransmit overlay graphics for received event information on eventopportunities. The client device 138 may be configured to transmit therequest to the overlay ad server 150 to transmit overlay graphics forreceived event information on event opportunities.

At 760, the request to transmit overlay graphics for received eventinformation on event opportunities may be received. The overlay adserver 150 may be configured to receive the request to transmit overlaygraphics for received event information on event opportunities.

At 762, the overlay graphics may be transmitted along with graphicmetadata to the client device 138. The overlay ad server 150 may beconfigured to transmit the overlay graphics along with graphic metadatato the client device 138.

At 764, the overlay graphics may be rendered as user-selectable optionson at least the client device 138 or the secondary device 140 pairedwith the client device 138, at event opportunities in the programmingmedia content. The client device 138 may be configured to render theoverlay graphics as user-selectable options on at least the clientdevice 138 and the secondary device 140 paired with the client device138, at event opportunities in the programming media content.

At 766, an input device that is in vicinity of the client device 138 orthe secondary device 140 may be paired with the client device 138. Theclient device 138 may be configured to pair an input device that is invicinity of the client device 138 or the secondary device 140 with theclient device 138.

At 768, it may be determined whether it is required to activate theinput device or the rendered overlay graphic at event opportunities. Theclient device 138 may be configured to determine whether it is requiredto activate the input device or the rendered overlay graphic at eventopportunities. In a case where a requirement exists, control passes to770. Otherwise, control passes to 744.

At 770, the input device may be activated, via the trigger client SDK146, while the overlay graphics are displayed at event opportunities inthe playback duration of the programming media content on the clientdevice 138 or the secondary device 140 paired with the client device138. The client device 138 may be configured to activate the inputdevice, via the trigger client SDK 146, while the overlay graphics aredisplayed at event opportunities in the playback duration of theprogramming media content on the client device 138 or the secondarydevice 140 paired with the client device 138.

At 772, one or more trigger responses may be received for one or moreevent opportunities, via the activated input device or rendered overlaygraphics. The client device 138 may be configured to receive the one ormore trigger responses for one or more event opportunities, via theactivated input device or rendered overlay graphics.

At 774, a request may be transmitted to the fulfillment server 152 todeliver services based on received one or more trigger responses. Theclient device 138 may be configured to transmit the request to thefulfillment server 152 to deliver services based on received one or moretrigger responses.

At 776, the request may be received from the client device 138 fordelivery of services based on received one or more trigger responses.The fulfillment server 152 may be configured to receive the request fromthe client device 138 for delivery of services based on received one ormore trigger responses.

At 778, services may be delivered in response to the received one ormore trigger responses on event opportunities in the programming mediacontent. The fulfillment server 152 may be configured to deliverservices in response to the received one or more trigger responses onevent opportunities in the programming media content.

At 780, one or more notifications may be delivered on at least one ofthe client device 138 or the secondary device 140 paired with the clientdevice 138, in response to the delivery of services or received triggerresponses. The fulfillment server 152 may be configured to deliver oneor more notifications on at least one of the client device 138 or thesecondary device 140 paired with the client device 138, in response tothe delivery of services or received trigger responses. Control passesto end.

FIG. 8 is a conceptual diagram that illustrates an example of a hardwareimplementation for the system of FIG. 1 that employs a processing systemfor delivery of over-the-top (OTT) services through media contentstreaming devices, in accordance with an exemplary embodiment of thedisclosure. In FIG. 8, the hardware implementation is shown by arepresentation 800 for the trigger management system 122 that employs aprocessing system 802 for management and delivery of services on adistribution network, in response to one or more event triggers, inaccordance with an exemplary embodiment of the disclosure, as describedherein. In some examples, the processing system 802 may comprise one ormore hardware processors 804, a non-transitory computer readable medium806, the integration manager 204, the content recognizer 206, the eventopportunity store 208, the trigger scheduler 210, the fingerprintingsystem 130, and the watermarking system 132.

In this example, the trigger management system 122 that employs theprocessing system 802 may be implemented with bus architecture,represented generally by a bus 808. The bus 808 may include any numberof interconnecting buses and bridges depending on the specificimplementation of the trigger management system 122 and the overalldesign constraints. The bus 808 links together various circuitsincluding the one or more processors, represented generally by thehardware processor 804, the non-transitory computer-readable media,represented generally by the non-transitory computer readable medium806, the integration manager 204, the content recognizer 206, the eventopportunity store 208, the trigger scheduler 210, the fingerprintingsystem 130, and the watermarking system 132 which may be configured tocarry out one or more operations or methods described herein. A businterface 810 renders an interface between the bus 808 and a transceiver812. The transceiver 812 facilitates communication via the communicationnetwork 154 (FIG. 1) with various other apparatus, such as the videoservice provider 110, the client device 138, and the secondary device140.

The hardware processor 804 may be configured to manage the bus 808 andgeneral processing, including the execution of a set of instructionsstored on the non-transitory computer readable medium 806. The set ofinstructions, when executed by the hardware processor 804, causes thetrigger management system 122 to execute the various functions describedherein for any particular apparatus. The non-transitory computerreadable medium 806 may also be used for storing data that ismanipulated by the hardware processor 804 when executing the set ofinstructions. The non-transitory computer readable medium 806 may alsobe configured to store data for one or more of the integration manager204, the content recognizer 206, the event opportunity store 208, thetrigger scheduler 210, the fingerprinting system 130, and thewatermarking system 132.

In an aspect of the disclosure, the hardware processor 804, thenon-transitory computer readable medium 806, or a combination of bothmay be configured or otherwise specially programmed to execute theoperations or functionality of the integration manager 204, the contentrecognizer 206, the event opportunity store 208, the trigger scheduler210, the fingerprinting system 130, and the watermarking system 132, orvarious other components described herein. For example, the hardwareprocessor 804, the non-transitory computer readable medium 806, or acombination of both may be configured or otherwise specially programmedto perform the operations and functionality of the integration manager204, the content recognizer 206, the event opportunity store 208, thetrigger scheduler 210, the fingerprinting system 130, and thewatermarking system 132 as described with respect to FIGS. 1, 2, 3A, 3B,4, 5, 6, and 7A to 7E.

The “OTT services” are services that allows at least one of the videoservices provider 110, the interactive service provider 120, or theinteractive service provider partners 148 to provide audio, video, andother media services (for example, purchase, payment, social sharing, orreverse bidding on products advertised in advertisements) to a consumerover the internet via streaming media, bypassing telecommunications,cable or broadcast television service providers that traditionally actas a controller or distributor of such content. Further, an entity thatdelivers the OTT service may not own a distribution network throughwhich the OTT service is delivered. In some cases, an entity thatdelivers the OTT service may partially or completely own a distributionnetwork through which the OTT service is delivered. For example, anentity that delivers the OTT service may deliver the OTT service througha network owned by a different service provider.

As utilized herein the terms “circuits” and “circuitry” refer tophysical electronic components (i.e. hardware) and any software and/orfirmware (“code”) which may configure the hardware, be executed by thehardware, and/or otherwise be associated with the hardware. As usedherein, for example, a particular processor and memory may comprise afirst “circuit” when executing first one or more lines of code and maycomprise a second “circuit” when executing second one or more lines ofcode. As utilized herein, “and/or” means any one or more of the items inthe list joined by “and/or”. As an example, “x and/or y” means anyelement of the three-element set {(x), (y), (x, y)}. As another example,“x, y, and/or z” means any element of the seven-element set {(x), (y),(z), (x, y), (x, z), (y, z), (x, y, z)}. As utilized herein, the term“exemplary” means serving as a non-limiting example, instance, orillustration. As utilized herein, the terms “e.g. and for example” setoff lists of one or more non-limiting examples, instances, orillustrations. As utilized herein, circuitry is “operable” to perform afunction whenever the circuitry comprises the necessary hardware andcode (if any is necessary) to perform the function, regardless ofwhether performance of the function is disabled, or not enabled, by someuser-configurable setting.

The terminology used herein is for the purpose of describing particularembodiments only and is not intended to be limiting of embodiments ofthe present disclosure. As used herein, the singular forms “a”, “an” and“the” are intended to include the plural forms as well, unless thecontext clearly indicates otherwise. It will be further understood thatthe terms “comprises”, “comprising,”, “includes” and/or “including”,when used herein, specify the presence of stated features, integers,steps, operations, elements, and/or components, but do not preclude thepresence or addition of one or more other features, integers, steps,operations, elements, components, and/or groups thereof.

Further, many embodiments are described in terms of sequences of actionsto be performed by, for example, elements of a computing device. It willbe recognized that various actions described herein can be performed byspecific circuits (e.g., application specific integrated circuits(ASICs)), by program instructions being executed by one or moreprocessors, or by a combination of both. Additionally, these sequence ofactions described herein can be considered to be embodied entirelywithin any non-transitory form of computer readable storage mediumhaving stored therein a corresponding set of computer instructions thatupon execution would cause an associated processor to perform thefunctionality described herein. Thus, the various aspects of the presentdisclosure may be embodied in a number of different forms, all of whichhave been contemplated to be within the scope of the claimed subjectmatter. In addition, for each of the embodiments described herein, thecorresponding form of any such embodiments may be described herein as,for example, “logic configured to” perform the described action.

Another embodiment of the present disclosure may provide anon-transitory machine and/or computer readable storage and/or media,having stored thereon, a machine code and/or a computer program havingat least one code section executable by a machine and/or a computer,thereby causing the machine and/or computer to perform the steps asdescribed herein to obtain a global workflow sequence.

The present disclosure may also be embedded in a computer programproduct, which comprises all the features enabling the implementation ofthe methods described herein, and which when loaded in a computer systemis able to carry out these methods. Computer program in the presentcontext means any expression, in any language, code or notation, of aset of instructions intended to cause a system having an informationprocessing capability to perform a particular function either directlyor after either or both of the following: a) conversion to anotherlanguage, code or notation; b) reproduction in a different materialform.

Further, those of skill in the art will appreciate that the variousillustrative logical blocks, modules, circuits, algorithm, and/or stepsdescribed in connection with the embodiments disclosed herein may beimplemented as electronic hardware, computer software, firmware, orcombinations thereof. To clearly illustrate this interchangeability ofhardware and software, various illustrative components, blocks, modules,circuits, and steps have been described above generally in terms oftheir functionality. Whether such functionality is implemented ashardware or software depends upon the particular application and designconstraints imposed on the overall system. Skilled artisans mayimplement the described functionality in varying ways for eachparticular application, but such implementation decisions should not beinterpreted as causing a departure from the scope of the presentdisclosure.

The methods, sequences and/or algorithms described in connection withthe embodiments disclosed herein may be embodied directly in firmware,hardware, in a software module executed by a processor, or in acombination thereof. A software module may reside in RAM memory, flashmemory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, aremovable disk, a CD-ROM, or any other form of storage medium known inthe art. An exemplary storage medium is coupled to the processor suchthat the processor can read information from, and write information to,the storage medium. In the alternative, the storage medium may beintegral to the processor.

While the present disclosure has been described with reference tocertain embodiments, it will be noted understood by, for example, thoseskilled in the art that various changes and modification could be madeand equivalents may be substituted without departing from the scope ofthe present disclosure as defined, for example, in the appended claims.In addition, many modifications may be made to adapt a particularsituation or material to the teachings of the present disclosure withoutdeparting from its scope. The functions, steps and/or actions of themethod claims in accordance with the embodiments of the presentdisclosure described herein need not be performed in any particularorder. Furthermore, although elements of the present disclosure may bedescribed or claimed in the singular, the plural is contemplated unlesslimitation to the singular is explicitly stated. Therefore, it isintended that the present disclosure not be limited to the particularembodiment disclosed, but that the present disclosure will include allembodiments falling within the scope of the appended claims.

What is claimed is:
 1. A system, comprising: one or more processors inan interactive service provider for delivery of at least one servicethrough at least one of a client device or a secondary device pairedwith the client device, wherein the one or more processors areconfigured to: insert at least one of a plurality of digital watermarks,a plurality of digital fingerprints, or a plurality of triggeridentifiers at a plurality of event opportunities in media content,wherein the media content is associated with a video service providerdifferent from the interactive service provider, wherein the insertionis executed by the interactive service provider via a first integrationwith the video service provider, wherein the media content comprisesprogramming media content and non-programming media content, and whereinthe plurality of event opportunities corresponds to a plurality ofcandidate time intervals in a playback duration of the media content,wherein the client device comprises a processor configured to: detectthe at least one of the inserted plurality of digital watermarks, theinserted plurality of digital fingerprints, or the inserted plurality oftrigger identifiers in the media content, in the playback duration ofthe media content at the client device, wherein the detection isinstructed by the interactive service provider via a second integrationwith the client device; identify, in a registry, information indicatingnext action of the client device based on the at least one of theinserted plurality of digital watermarks, the inserted plurality ofdigital fingerprints, or the inserted plurality of trigger identifiersin the media content and instruction calls raised by a trigger clientsoftware development kit on behalf of the client device; render aplurality of overlay graphics on the media content for the plurality ofcandidate time intervals based on the identified information; activate,via the second integration, at least one rendered overlay graphic of therendered plurality of overlay graphics and one or more input devices invicinity of the client device, wherein the at least one rendered overlaygraphic and the one or more input devices are activated based onprevious service requests for the plurality of event opportunities onwhich the at least one service was previously delivered; present the atleast one rendered overlay graphic as user-selectable option on thesecondary device based on usage statistics of the secondary device andengagement with the at least one rendered overlay graphic on theactivation of the at least one rendered overlay graphic; receive, viathe activated one or more input devices, one or more trigger responsesover the at least one activated overlay graphic in the playbackduration; and display an interactive view on the client device, whereinthe interactive view displays a message and a transfer of a first viewassociated with a selection on the user-selectable option to a secondview associated with a fulfilment server along with a playback of themedia content in the playback duration, wherein the message indicatesthe delivery of the at least one service, and wherein the message andthe transfer of the first view associated with the selection on theuser-selectable option to the second view associated with the fulfilmentserver are displayed based on the playback of the media content in theplayback duration and the received one or more trigger responses,wherein the delivery of the at least one service is enabled at theclient device that is in communication with the fulfillment server, andwherein the fulfillment server fulfills the delivery of the at least oneservice to increase a user engagement to select the at least oneservice.
 2. The system according to claim 1, wherein the one or moreprocessors are further configured to receive, from the video serviceprovider, the media content that comprises the programming media contentand the non-programming media content.
 3. The system according to claim2, wherein the one or more processors are further configured to receivea request, from the video service provider, for analysis of the mediacontent, and wherein the analysis comprises at least one of digitalfingerprinting of the media content or digital watermarking of the mediacontent.
 4. The system according to claim 1, wherein the one or moreprocessors are further configured to: fingerprint the programming mediacontent and the non-programming media content; and generate fingerprintinformation for the programming media content and the non-programmingmedia content, wherein the fingerprint information comprises at leastone of acoustic fingerprint information or video fingerprint informationfor different segments of the programming media content and thenon-programming media content, and wherein the different segmentscomprise at least one of audio frames or image frames of thenon-programming media content.
 5. The system according to claim 1,wherein the plurality of digital watermarks are inserted into at leastone of an audio portion of the media content or a video portion of themedia content at the plurality of event opportunities in the mediacontent.
 6. The system according to claim 5, wherein the one or moreprocessors are further configured to generate watermark information thatcomprises metadata for at least one of an acoustic watermark or a videowatermark inserted between different segments of the media content. 7.The system according to claim 1, wherein each event opportunity of theplurality of event opportunities corresponds to a specified event in themedia content, and the specified event corresponds to at least one ofstart credits, end credits, lull points, time-based marked regions,location-based marked regions, context-based marked regions, explicitscene start and end, or product placement opportunities, within themedia content.
 8. The system according to claim 1, wherein each triggeridentifier of the plurality of trigger identifiers is at least one ofSociety of Cable and Telecom Engineers (SCTE) 35 triggers, playlists,manifest tags, SCTE-104 triggers, Nielsen ID3 tags, inaudible beacons,image beacons, or data beacons.
 9. The system according to claim 1,wherein the processor in the client device is further configured to pairup at least one of the one or more input devices available in vicinityof at least one of the client device or the secondary device, andwherein the at least one of the one or more input devices is paired viaat least one of a Bluetooth network, a Wi-Fi network, an internet-basednetwork, a wired local network, or a wireless ad hoc network.
 10. Thesystem according to claim 1, wherein a virtual voice assistant isenabled on at least one of the one or more input devices, the clientdevice, or the secondary device.
 11. The system according to claim 10,wherein the one or more input devices, enabled with the virtual voiceassistant, are configured to actively wait and listen for a triggeridentifier of the plurality of trigger identifiers in an audio portionof the media content.
 12. The system according to claim 11, wherein theprocessor in the client device is further configured to: instruct thevirtual voice assistant to detect at least one beacon within theplayback duration of the media content; and facilitate a conversationwith a user associated with the client device, to receive a triggerresponse from the user, wherein the trigger response is received througha user speech input.
 13. The system according to claim 1, wherein thenon-programming media content comprises promotional media content for atleast one of a product offering or a service offering, and wherein thepromotional media content comprises at least one of graphical content,textual content, video content, or animated content.
 14. The systemaccording to claim 1, wherein the at least one service comprises: atleast one of a direct payment, a direct transaction, or a directnotification for selected at least one product offering or at least oneservice offering, and a direct update of the selected at least oneproduct offering or the at least one service offering on at least one ofa shopping cart, a personalized list, or a social platform.
 15. Thesystem according to claim 1, wherein the processor in the client deviceis further configured to execute a check to determine whether toactivate the one or more input devices from a set of input devices andthe rendered plurality of overlay graphics.
 16. The system according toclaim 15, wherein the determination is based on the check executed on adefined criteria that is associated with records of the previous servicerequests for the plurality of event opportunities on which the at leastone service was previously delivered, and wherein the defined criteriacomprises at least one of a user-defined constraint, an inventoryconstraint, or a specified threshold count of the one or more triggerresponses for at least one of a product offering or a service offeringpromoted by the non-programming media content.
 17. The system accordingto claim 1, wherein the one or more processors are further configured todeliver at least one notification on at least one of the client deviceor the secondary device paired with the client device, and wherein theat least one notification is delivered based on the received one or moretrigger responses.
 18. The system according to claim 1, wherein the oneor more trigger responses correspond to one or more user interactionsthat are received based on at least one of a touch input, a gestureinput, a haptic feedback input, or a voice command input.
 19. A method,comprising: in a system that comprises an interactive service providerfor delivery of at least one service through at least one of a clientdevice or a secondary device paired with the client device: inserting,by one or more processors of the interactive service provider, at leastone of a plurality of digital watermarks, a plurality of digitalfingerprints, or a plurality of trigger identifiers at a plurality ofevent opportunities in media content, wherein the media content isassociated with a video service provider different from the interactiveservice provider, wherein the insertion is executed by the interactiveservice provider via a first integration with the video serviceprovider, wherein the media content comprises programming media contentand non-programming media content, and wherein the plurality of eventopportunities corresponds to a plurality of candidate time intervals ina playback duration of the media content, detecting, by a processor inthe client device, the at least one of the inserted plurality of digitalwatermarks, the inserted plurality of digital fingerprints, or theinserted plurality of trigger identifiers in the media content, in theplayback duration of the media content at the client device, wherein thedetection is instructed by the interactive service provider via a secondintegration with the client device; identifying, in a registry,information indicating next action of the client device based on the atleast one of the inserted plurality of digital watermarks, the insertedplurality of digital fingerprints, or the inserted plurality of triggeridentifiers in the media content and instruction calls raised by atrigger client software development kit on behalf of the client device;rendering, by the processor in the client device, a plurality of overlaygraphics on the media content for the plurality of candidate timeintervals in the media content based on the identified information;activating, by the processor in the client device, via the secondintegration, at least one rendered overlay graphic of the renderedplurality of overlay graphics and one or more input devices paired withthe client device, wherein the at least one rendered overlay graphic andthe one or more input devices are activated based on previous servicerequests for the plurality of event opportunities on which the at leastone service was previously delivered; presenting the at least onerendered overlay graphic as user-selectable option on the secondarydevice based on usage statistics of the secondary device and engagementwith the at least one rendered overlay graphic on the activation of theat least one rendered overlay graphic; receiving, by the processor inthe client device, via the activated one or more input devices, one ormore trigger responses over the at least one activated overlay graphicin the playback duration; and displaying, by the processor in the clientdevice, an interactive view on the client device, wherein theinteractive view displays a message and a transfer of a first viewassociated with a selection on the user-selectable option to a secondview associated with a fulfilment server along with a playback of themedia content in the playback duration, wherein the message indicatesthe delivery of the at least one service, and wherein the message andthe transfer of the first view associated with the selection on theuser-selectable option to the second view associated with the fulfilmentserver are displayed based on the playback of the media content in theplayback duration and the received one or more trigger responses,wherein the delivery of the at least one service is enabled at theclient device that is in communication with the fulfillment server, andwherein the fulfillment server fulfills the delivery of the at least oneservice to increase a user engagement to select the at least oneservice.
 20. The system according to claim 1, wherein the one or moreprocessors are further configured to receive a request from the videoservice provider that includes information specifying a type ofdetection in the playback duration of the media content at the clientdevice.